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PURIFIED ST AT PROTEINS AND METHODS OF PURIFYING THEREOF 

r.nvFRNMF.NTAL SUPPORT 

5 The research leading to the preser^t invention was supported, at least tn part, by NIH 

Grat^c Nos. A132489 ar,d AI34420. Accordtt^gly, the Governtner.t may have certatn r.ghts 

in the invention. 

Pr-.-. o.:np,.=Nrp TO REI ftTFD APPI ICATIONS 

. '\ T.e present App.ca.on ,^based upon pro.is.ona. appUenUon U.S. Serial No. 60,028,176, 
■ '^'^ ' ' nied Ocober 15. 1996,,,Kf disclosure of which Is hereby l„corpora,ed by reference ,„ us 
\ I 1 ;Vr e«lre,y. Applicants claim .V benefus of .his AppUcaiton under 35 U.S.C. § U9(e). 

/ 

QPj_P r^F THF INVENTION 

The presen, ,nven.,on relates generally ,o me,ho4s of purify ing recomb.nan, Stat proteins, 
.odtfied Sta, protelr^ and h^nctiona, fragments thereof. Included in the presen. .n.ent.on 
are .he puriHed pro.etns and fragn,en.s .hentselves. The presen. .nventlon also relates to 
20 me,h„ds of separating phosphorylated species of these proteins and ftagments from .he 

nonphosphory.a.ed fomts. The presen. ,n,ention also relates to methods for us.ng purtHed 
Stat proteins, truncated Stat protetns or N-termlna. fragments of Sta. prote.ns for drug 
discovery. 

PArtcr.ROllNr '^CTHF INVF.NTION 

Transcription factors play a major role in ceilu.ar Unction by inducing the transcription of 
specific mRNAs. Ttanscrlp.ion fac.o.s, .n turn, are controlled by dist.nc, signalling 
molecules. One particular fam.ly of transcription factor consists of the Signal Tran«iucers 
30 and Activators of Transcnption (Stat, ptotetns. Presently, there are seven Rnown 

manunalian Stat family members. The recent discovery of Drosophlla and Die yostei.um 
discoideum Stat prote.ns suggest .ha. S.a. proteins have played an '-PO-' 
transduction s.nce the early siages of our evolution ,Van R. e, Ce« 84:421-430 
(1996); Kawata et al. Cell 89:909 (1997)]. 

35 

f lora^ orniin of signalling molecules including the 
Stat proteins mediate the action of a large group ot si^na g 
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second messenger, . , AMP ^,„3,,„„3,ion o„ , single „™si„e 

S,.t prote,ns are acuva.ed ,n .he cy P ^^^^ ^^^^ 

11 . ^/ iQQd Schindler and DarnelK 1^^^)- i^^^ ^ f 
5 (Darnell ./„ 1994. Sch _^ ^^^^^^ ^^^^^^ .^.^^ 

Ugand-activated transmembrane re eptors ^^^^^^ ^ 

EGF- or PDGF-receptors, or cytokme receptors that lack m 

d JAK ktnases such as those for tnterferons and mterleuktns (Ihle, 1995). Whe 
associated JAK kmases, bu .^.^Himeric structures in which 

S,at prceins are phosphory.a.ed, .he, torn, homo- or hererod.m r.c s m 
,0 ..e pUo.,r„s,„e o, one panner .nds .o r.e SKC — '^-^^^^ 
o.er. The new. ,ormea ..er ..en "J /j'^; 1 . 

GAS sequence, thereby acivating .ranscnprion (Shua, « aL. 

Leung a/., 1996). 

„ ,he caoacUy as a direc. messengers belween rhe cy.olcme or growth 
,5 Stat proterns serve ,n capa ^^^^ 

factor receptor present on the • ^ ^^.^ 

cyt„..ne and growth factor produce a specf.c " ' 

. :::: t;:::: ::::: .nown —.an s.. pr„.e,ns .ame. « , 

277:1630-1635 (1997)]. 

• H to further study the biochemistry of Stat proteins. Unfortunately 

tag at the COOH term production of milligram 

; ^^f^ Q MQQ6^ However, no one has repoucu ^ 

S,. pro.e.n, no. .ore ,„,por.a„.,y. a pur..ca.,„n process amenah.e 

30 to scaling up for such quantitative isolations. 

TO perform the hiochemlca. s.ud.es necessary .o undersrand the ntechanisn, of dre Stat- 



p.„a.cuo„ of ,a,g= a.o„„.s of pure pro.e.n. Fu„h.™o,e, .her. is a need for a ™a» 
specifca,,, pho,ph»ry,a,i„g .he correc. tyrosine residue on a S>a. prore.n and .hen 
separ.„ng the resulr.ng phosphoryiated S.ar pro.ein fron, ,he „„pl,ospi,or,.a,ed forn, ,n 
nuamuarive yields^ In addUion. there i, a need to produce large =,„ant,t,es of stable, 
5 solubie truncated Stat proteins that retain ^nctiona, acttvit.es of the corresponding natt.e 
Sta, protein. Finally, there is a need to develop methods of isolating these funconal 
truncated Stat proteins. 

The citation of any reference herein should not i. construed as an admission that such 
,0 reference is availalale as "Prior Art" to the it«tant applicat.on, 

c,.K.»4>PY OF THF TNVF.NTION 

The present invention describes recombinant hunran Stat proteins whtch are produced In 
15 insect ceils tnfected with recombinant baculovlms. Stable truncated forms of these 
proteins produced in bacteria are also Included in the present invention. The present 
invention also includes labeled recombinant human Stat protetns and truncated Stat 
protetns. One .spec, of this invention Includes the purifcation of large amounts of these 
ecomblnant prote,.. These isolated Stat proteins can be isolated in enher their acttvated 
20 fontt having a phosphorylated tyrosine, or tn the nonphosphorylated state, where 
corresponding tyrosine res.due is not phosphorylated A related aspect to the tnvent.on 
details the protease sensttivi.y of Stat proteins and the important conse,uences of thts 
particular property. The present inventton exploits this property and describes a 
recombtnant tmncated Stat protein that can be expressed in a bacteria, host ,n large 

-n^^ fi^Qr-hina of the On 
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recomDinani iruui^dicu -j^o.^ f^-'- - , , t 

ouantities, as a soluble protein that can be readtiy purified b, the teaching of the presem 
invention. The phosphory lated and nonphosphorylated form of the tttmcated .ta, protein 



can also be individually isolated 



30 



The expression of the truncated protein in a soluble form overcomes earher fatlures, 
Where recombinant Stat proteins formed almost exclusively insoluble inclusion bodies. 
Other potentially active fragments of Stat protetns that contain the DNA b.nd.ng domatn 
either form tnsoluble tncluslon bodies or are themselves so suscepttble to proteolysts that 
isolation Of the large quantlttes necessary for btochemical studies are not pract.ca Thus 
the ptesent tnvent.on teaches for the firs, t.me, a soluble recombinant truncated Stat 
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protein, as well as methods of its expression and isolation. 

Although the present .nvention .nc.udes all Stat prote.ns, when specif.c amino ac.d 
"Is are L.Oed nnm.er, the numher represents the se.uenttal pos.non of that 
5 amino actd m the am.no ac.d sequence of Static. Thus, the number denoted for a 
specned amtno actd tn Static and Statltc, as used herein, ts per tts correspondm. 
position in the amino acid sequence of Stat la. 

The present mventton includes a truncated Stat protetn that can be expressed as a soluble 

K . •ih.stcell m preferred embodiments the bacterial host IS 
[0 recombinant protein in a bacterial host cell. In prerer 

E CO,, a„a .Ke so,„b,e .™nca.d S.a, prcein r^.. up a. ,.a,, 30% of ,he ,o,a, 
eco™.i„an, — S.. p.o,.n p,o.„.. -n a .o. p.,».. ...oa,.» .he so,* 
«d S,a. pro«,n ma.es up a, ,eas, 50% of ch. .o,., recombiuan. fruneafed 
pro,e,„ produced. In one embodimen,. .be .ru„ca,ed S,a, pro,ei„ has an am.no acd 
,5 ,„e„c subs,an,ia„y s™„a, ,o SEQ ,D NO., .n a„o,be, embodi^en. .be ,™„ca.ed 
p ,e.„ bas an annno acid sequence of SEQ ,D NO.. ,n preferred en.bod,„,e„.s. . 
Id Srar pro.e,„ ,s pur.fed. .n one var,ad„n of .bis „pe, .be purged .runcared S,a. 
p:.l e,.ibi,: a si„„e proiein band on SOS-PAOB, run under reducing condnrons. 

.0 Tbe S.a, proteins, inCuding ,b, ,runca,ed S,a. pro.eins of .be presen, .nven.ion are 
!c.,va.ed «ben a .,.os,ne residue of .be pro.ein is pbospbor„a.ed. ,n a preferred 
rrlen. Of .b,: .,pe. .be pbospbor„a,ed .yrosine .s ryros.ne ,0, of .be S.a... abnno 
acid sequence shown in SEQ ID NO;,- 

,5 ,„ one embodimen.. .be purified ,..nca.ed S... prorein is subs.an.ia„y or comp,e.e,y free 
pChorv,a.ed fom, ,n a„„.ber embodrmen., .be purified .runca.ed S.a. prorern is 
bs.an.ia„ or conrp.e.ed pb„spbory,a.ed. ,n ye. a .bird en„bod..e„.. .ne purrfied 
lied L prorein ,s a .ix,nre of .be „o„ph„spbo,y,a,ed and pbospbory,a,ed forms. 

30 one enrbodimen. of .he presen. inven.icn is a purified S.a, pro.e.n .ba. ,s eidrer 
subs.an.ia„y or con,p,e.e„ fiee of ..s correspondins pbospbory a.ed^ '''"^ 
,be a,.erna.,.e. is essen.ia„y or en.ireiy in ,be corresponding P-P--'^ ^ 
f„™ one varia.ion of .bis embodimen. exhibirs a s,ng,e pro,e,n band on 7 SD 
Toe run under reducin, cond,.ions. and bas an amino ac.d sequence subs,an.,a„y 
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. , SEO ID NO- . in ancher varia.ion .he pur.f.ed S.a. prcei,,. .xh.bi.s a si„sl. 

^' r d 1 7y SDS-PAGE, run under reducing condUions, and .a, an anr.no ,c,d 
p,„,e,n band on 7% PA . ,^ ^ 

sequence sub„.m,ally s.nular ,o SEQ LD N0.2 s^s-PAGE run under 

,„„„ed S.a, pro,e>n ,ba, exb.bns a s>ns,e prorcn « ,^-^,,„ 

reducing condirions and has an amino acd sequence of SEQ ID NO 1 

"^n ot .biS e^bodimen,, .be puriHed S.a. pro.ein exbibirs a s.ngie prorein ban on 
;rsDS-PAOE, run under reducing conditions, and Kas an an.no ac.d sequence of SEQ 



ID N0:2. 



• .nd nurified Srat proteins including the purified truncated Stat 
1 ThP truncated Stat proteins and puririea ^i^l yi , , • ^ 

, „e is an a„erna,.ve poiar neu.ra, a.,no acd sucb » — 

allernari.e polar neutral amino acd is a glycme, a senne. o 
Idimen s con.ai.ng modir.ed cysreines. ,be modified cysreine is as .„ al.yla ed 
or a cysrcne conraining a mercurial, or rbe ,b.ol is o.idi.ed and forms a 
20 disulfide bond with a second thiol moiety. 

.iWlated bv a variety of alkylating agents including 
The alkylated cysteines may be alkylated by a . ... ^ ^ , ^ithiobis- 

s s/dithiobis(2-nitrobenzoic acid), 2,2/-ditnioDis 

25 cysteines are alkylated by N-ethyl maleimide. 

The purtned truncated Stat protems and punHed Stat protems. including the purged 

nf the oresent invention, can also have more than one converted 
truncated Stat protems of the present mve cr.„ „ or a fragment thereof 

..steme. In one embodiment of this type, the Stat protein ^ 

• of rv^rHne 155 Cyste ne 440, and Cysteine 4yz ui 
30 and has three converted cysteines ^ C^-^;^^^ / ^^^^^ 
Statla amino acid sequence shown m SEQ ID NU.i. 

W anv form as listed above, including each cysteine taking an alternative form. In 
take any form hste ^^^^ ^^^^ .^^^^^ homocysteine, 

such embodiment Cysteine 155 is alkyiateo, y , ^ .. three 

and cysteine 492 is substituted by a threonine. In a preferred embodiment, a. 
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conver... c,s,=,n=s arc alkylated cysteines. AU of .hese S,a, procelns a„d pur.hed S^^ 
p.„.i„s can purified ro exh.tU one band on 7% SDS-PAGE, under reduaug cond , ns 
n ..her rheir pKosphory.ared, ac.a.ed ..are or ,n .heir eorrespondln, nonp.o.pKory.ared 



form. 



The present rn.enr.on also includes purified S,ar N-.ernrinal peptide fragnrems. These 
pep„de fragrnems consis, of a protein dontain that can be seiectively Ceaved by m, d 
proteolysis wtth subtU.stn o, proteinase The N-terminal peptide fragnrents can orm 
hontodinrers. As part of a Stat protein, the N-ter™,nai donaain serves to enhance the 
,0 binding of two adjacent Stat diners to a pair of c.osel, aiigned DNA bindtng sttes 

bindin sites separated by appro.— y iO to ,S base patrs. .in a preferred e.^d,rr,e t 
the N-Lina, peptide fragnrent has an atnino acid sequence substanttaiiy st.far to that f 
SEQ ID NO:4. In a more preferred embodiment, the N-terminai peptide fragment has an 
amino acid sequence of SEQ ID N0;4. 

" The present invention. a,so tnc.udes antibodies to the truncated Stat protein, and the 
N-,ermina, peptide fragment of a Stat protein, as purified from recombinant sources or 
produced by chemicai synthests. and derivatives or attalogs thereof, includmg tuston 
proteins. Such anttbodi.s include but are not limited to polyclonal, monoclonal, c tmenc. 

20 Lgle Chain, Fab fragments, and a Fab expression library. These antibodtes ntay be 
labeled. 

The present inventton also includes nucleic acids comprising nudeottde sequences that 
encode a truncated Stat protein. In one embodiment the nucleic acid comprtses a 
« nucleotide sequence that encodes a truncated Stat protein having an amino acd sequettce 
tna, ,s substantially simdar to SEQ ID NO:3. .n a related embodiment the nucietc actd 
comprises a nucleot.de sequence that encodes a truncated Stat protei. ^-"^ ~ 
actd sequence o, SEQ ID NOa In ye. another embodimem the nucletc acd comprtses 
nucleotide sequence that is substanttaiiy simtiar to SEQ >D NO:5 and codes tor the 
30 expression of a truncated Stat protein. In sttU another embodtment the nucietc acd 
contains a nucleotide sequence having the sequence of SEQ ID NO:,, 

The present invent.on also includes nucleic acids that comprtse a nucleot.de sequence 
encoding an N-termtnal fragment of a Stat pro.etn. In one embodimen. .be nucietc acd 



c„mpr,s» a nuCecd. sequence m,. encode, a S.a, N-cern,i„al .rag^m having a„ a„„no 
a„a sequence ,ha, is ,ubs.an.ia„y si™,ar ,0 SEQ ,D NO:4. ,n a reia.ed en,bod.n,en. ,he 
nucieic acid con,pr,ses a nnc.ecde sequence ,ha, encodes a S.a, N->e™,na, fragment 
Hav,„g .he amino acid sequence of SEQ ID NO:4. In ye. ano.her embod.men. .he nncle.c 
5 acid compr,ses a nuc.eo.ide sequence .ha, is s„bs.a„,ia»y sim.iar .o SEQ ,D NO:6 and 

„„ „f , s,ai N-.erminal fragment. In s.ill ano.her embodimera ,he 

codes for the expression ot a btat in lermiudi uag 

nucleic acd coraains a nucleotide sequence having ,he sequence ot SEQ ID N0:6. 

A„ of ,he nucleic adds of .he presen. invemion can also contain he,e,o,og„us nuc,eo,ide 

10 sequences. 

Meihods of phosphotylafihg ,he S,a, pro.eins ,„ are a.; included in .he presen. 
i„ven.ion. In one embod.men. .he phosphoryla.ion .s performed wi.h a prepara.ton of 
EGF-receptor kinase. In preferred embodiments the EGF-receptor preparation ,s ob,a,ned 
i5 from cel, lysates and purified w.th the use of an ami-EGF-receptor 

against the extraceilu.ar domain. In some such embodiments the resuittng EGF-receptor 
antibody complex is precipitated with Protein A agarose beads. ,n another preferred 
embodiment the ant.body is a monoclonal a„.ib^y. In yet another preferred embod.ment 
,he cell lysates are from humans. In the most preferred embodiment of th.s method, the 
20 antibody is a monoclonal amibody and the cell lysa.es are from humans. 

The present i.vem.on also includes me.hods of separa.tng phosphoryla.ed S.a. pro.etns 
including phosphorylated truncated Stat proteins from their nonphosphorylated 
counterparts. Although these methods may be properly applied to all Stat proterns, and 
25 their corresponding truncated proteins, rn preferred embodiments the Stat protem has an 
a.^i.0 acid seauence of SEQ ID N0:1 or SEQ ID N0:2, and the truncate! Stat protem 
has an am,n„ ^cid sequence substant.ally sim.lar to SEQ ID N0.3. .n more pre.erred 
embodiments ,he Sta, pro.ein or .he .^nca.ed S.a. pro.ein also has a conver.ed cys.e.ne. 
,„ the most preferred embod.men.. the Stat protein or truncated Stat prote.n has three 
30 converted cysteines which are alkylated cysreines at Cyste.ne 155, Cysteine 440, and 
Cysteine 492 of the Static amino acid sequence shown m SEQ ID NO: 1. 

,n one embodiment a m.xmre con.aimng phosphorylated S.a. pro.e.n and 
nonphosphorylared S,a, prcein are placed o„.o a heparin-solid suppor.. In preferred 
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embodiments the heparm sohd support . e.ther heparm agarose, heparin SEPHADEX or 
heparin cellulose. In the most preferred embodiment the heparin-solid support ,s hepann 



agarose. 



5 In one variation of this embodiment the heparm agarose is washed first with a low-salt 
buffer to remove materials that either bind more weakly than the nonphosphorylated Stat 
protein or do not bind at all. The Stat proteins are eluted from the heparin agarose as a 
function of salt concentration with the nonphosphorylated Stat protein eluting at a lower 
salt concentration than the phosphorylated protein. In one particular embodiment of this 

10 type the protein is eluted with a salt gradient. In a preferred embodiment, the elation of 
the heparin agarose is performed stepwise with an approximately 0.15 M monovalent salt 
elation step, followed by an approximately 0.4 M monovalent salt elation step. In this 
case the anphosphorylated Stat protein elates daring the first elation step, and the 
phosphorylated Stat protein elates daring the second elation step. In a more preferred 

15 embodiment the monovalent salt is potassiam chloride. 

This procedare mav be performed by a batchwise method, thoagh in preferred 
embodiments the heparin agarose is placed in a colamn. The procedare may be 
performed by simple controlled pumping of the colamn, or by HPLC, FPLC and any 
20 other analogoas methodology; or the colamn may be allowed to fiow by the pressare of 

gravity. 

The present invention also inclades methods of preparing a parified alkylated Stat protein 
and methods of preparing a parified alkylated trancated Stat protein. Althoagh these 
25 methods may be properly applied to all Stat and trancated Stat proteins, in preferred 
embodiments the Stat protein has an amino acid seqaence of SEQ ID N0:1 or SEQ ID 
NO-2 and the trancated Stat protein has an amino acid seqaence sabstantially similar to 
SEQ ID NO-3 In one such embodiment an expression vector containing a nacleic acid 
that encodes a Stat protein is placed into a compatible host cell, and the Stat protein is 
30 expressed. The compatible host cell is grown, harvested and then the expressed Stat 
protein is released from the host cell. In a preferred embodiment the expressed Stat 
protein is released from the host cell by lysmg the cells. The Stat protein is then treated 
with an alkylating agent to alkylate one or more cysteines involved in intersabanit 
aggregation. The alkylated Stat protein is then isolated, yielding a parified alkylated Stat 



protein. 



,„ a„o*er sucK e..o<.,.=n,. .h. »pre.i„„ veaor .o„,a,„s a nuCeic aud .ha, encode a 

H Ilna, sequence .... . .u.s,3„,ian, sl.Ha. ,o ..e N..e...nus .he co..espo„d.„s 

I I, S.a, prcein foU„»,„s .he c>eava.e of .he p.o.eo,y..c se„.,.ive N-.e™.„a, do™a,„ 

c„.Ip„„d.„s S,a. p,o,e,„. The ca.ho.„ .e...hus of .he .™hca.ed S.a, pro.e.n 
ex.ends a. ,eas. .o .he phospho,y,a.ah>e .y.os.he ,e,u.ed for homod,.er,za.,o„^ In 
preferred e.bod,h,en.s, a,ky,a.io„ i. performed h, .ncuha.ing .he S.a. pro.e.n «uh 
H e.hyf h,a,e..ide. ,„ more preferred e.hod,.e„.s, ahou. 40 .o 50 o p.r.fi^ 
:;:ed .ru„ca.ed S.a. pro.e.n can he oh.ai„ed fro. . U.ers ^f s.ar.ih. Cure. Th 
„eLds can also Include a s.ep of phosphory, a.ing .he S.a. pro.e,n e,.her pnor .o or 
preferaWy foUow.ng a..y,a.ion. >n preferred .e.hods of .h. .ype, preparar.ons of 
EGF-recep.or kinase are used in the ,« vi.r. phosphoryla.,ng s.ep. 

' The presen. inve„.,on aiso inCudes .e.h„ds of prepar,n, a purified suhs.i.u.ed S.a. prorein 
™e.h„ds Of preparrn. a purified suhs.,.u.ed .ru„ca.ed S.a. prcein^ ^ ^u.h 
,hese n,e.hods n.ay he properiy appiied .0 ah S.a. proreins .nclud.ng — S- 
proreins in preferred enrhodimen. .he S.a. prorein has an anuno acd sequence 
„ 0 SEQ .D .0.2. and .he .nrnca.ed S.a. pro.e.n has an a.ino acd sequence 

,„ SEO ID NO 3. In one such embodimen.. an expression .ecor 

.cid suhs.i.u.ed for a cys.eine of .he S.a. pro.e.„. .herehy rep.acn, ,.. In on preferred 
embodimen., .he amino acd is a polar neurra, am.no acid. In a var,a.,on of ,h,s 
, : odimen. .he a,.erna.,ve polar neu.ral ammo acid is a ,ycine. In a„o, er — of 
,h,s emhod,m=n., .he al.erna.ive Polar neu.ral amino acd is a ser.ne. In sr.U ano.her 
:„ia.ion of .h.s embodimen.. .he al.erna.ive polar neu.ral am.no acd ,s a .hreon.ne. In 
preferred embodimems, .he cys.eine .ha. has been replaced was involved m ,he 
inrersubuni. aggregarion .ha, takes place be.ween S.a. prorems 

The expression vecor is ,hen placed in,„ a companble hos, cell, and .he subs...u,ed S.a. 
in i expressed. The compar.ble hos. cell is grown, harves.ed and .hen .he expressed 
bs.,.u.ed L pro.cn ,s released from ,he hos. cell. In a P''-^ — = 
expressed S.a. pro.ein is released from .he hos. cell by lysing .he cells. The subs.uu.ed 
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S„. pro,ei„ IS ,h,„ iso,a.ed, y.eldi.g a purified subs.Uu,ed S.a. pro.ein, 

,„ „„e =..„d,.e„., expression vecor co„,a.„s a nuCeic acid .ha, encodes a 
s„.s.i.u,ed c™„ca,ed S.a, p.o.ei... m one such e.hodi.e„., an express.on ec.o con a. 

c,e,c acd .ha. encodes a suhs.,.u.ed .^nca.ed S.a. p,o.ein .ha. has an a,,e.„a..v p a. 
nla, a.,no acd suhs.i.u.ed a c,s.e,„e . S.a, p.o.e.n, ------ ^j",^ 

one varia..on of .his embod.nnen.. ,he a„e,na,ive po.ar neu„a, a„,no acd ,s a siycne- ,n 
a , va.ia.ion of .his e.hod,.e„., ,he ahe,n..ive poia. neu,.a, a.ino acd ,s a se.,ne. 

a„o,he. va,ia,ion of ,his e^hodi.en,. .he aherna.i.e poia. neu.ral anr.no acd 
. r ine In a preferred enrbodinren,, ,he cysreine .ha. has been replaced was .nvoived rn 

: rsubuni, a..e.a..on ,ha. ..es piace bcween S.a, pro,,=,ns J-J— 
,„„ca,ed S.a, pro,ein has an arh.no acd s.,uence which is essen„a.„ .he sa.e as he 
p™ , as.-resis.an. domain of ,he S... prorein^ .n preferred e.bodin,en.s, abou. 0 ,o 50 
: pu ,r.ed subs.,.u,ed ,r„nca,ed S,.. pro.ein can be obrained fronr 6 fi.ers of s..r„„. 
: : e These .e.hods can aiso i^fude a s.ep of phosphor„..ins .he S.a. pro.e.n or 
:rca.ed S,a, pro.ein. .„ a preferred n,e.hods of .h.s .pe. an HOP-recep.or .nase 
prepararion is used in ,he in vuro phosphoryla.ing s.ep. 

,„ some en,bod,..n.s. a suhs.i.u.ed S.a, pro.e,„ or a subs.i.„.ed '-^^^^^^ ^ 
, a,so a,i.„a,ed. In such cases an expression vecor conraining a nudcc acd ,ha, encc^ 
:.,.u:ed S.a. prorein or a subs.,.u,ed .runca.ed S.a. pro.ein is pfaced in.o^a cor^p..,b,e 
hos. ce.,, and expressed. In one e.bod..en, .he s„hs.i.u,ed S.a, P'-'" 
repiacenren, amrno acd ,ha, is an .,.erna.,ve polar neu,ra, « acd. In a re e^ 
cldin,e„. ,he a,.er„a.,ve poiar neu.ra, »rino acid is a gfycine. — 
.5 The compatible hos. cell is g.own, harvested and .hen .he expressed subs„.u,ed S.a. 
l.„ or suhs,.,u.cd .™nca.ed S.a. pro.ein is released from .he hos, cell as descr, e. 
ere,n The subs,i,u.ed S.a. pro.ein or subs.nu.ed .runca.ed S.a. pro,cn ,s .h n , ea.ed 
wi.h an al.yla.ing agen. ,o alUyla.e one or more cys.eines .nvolved in in.ersubun 
,ggrega.ion. The al.yla.ed subs.i.u.ed S.a. pro.ein or alk,la.ed subs.,.u,ed .runca.e S a. 
30 p .eil is .hen isola,ed, „eldi„s a pur.Hed al.,la.ed subs,..u.ed S.a. prore n or pur, f.ed 

A nrntPin In Dreferrcd embodiments, alkylation is 
alleviated substituted truncated Stat protein, in preicucu 

Xmed b, incuba,ing .he S,a, prore.n or ,ru„ca.ed S.a, pro.ein wi,h N-e.hyl ma e,m.de. 
I more preferred emh„d,men,s ahou, 40 ,o 50 mg of purged al.,l..ed suhs,..„,ed 
.n,„ca,ed S,a. pro.ein can be obrained from 6 l..ers of s.aning cuUure 
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presen. ,n«nUon also inc.^des n,e,.ods o.' i.e„>.,ying drugs ... effec, m= in,e„c.ion 

• . -^^ p^M A hindinc Site, inis assay is ua^^^^ 

..... —-^^^^^^^^^ 

..„,e. ,0 a pa,r of dose, ..,ne. DNA .....g sU.s, ..nd. , su s s 

::r at:: r — . — .„ .... a. s..... 

, as . p.osp.c,W= - ^^a. ...... a„. ...e, .„ a ...H 

.hroughpu. assay based on .he me.hods described below. . 

Tel. o a prospecve d™, on ..e a,f.„l,y o, ,he S,a. pro.ein-DNA bn,d,n. , 

. „L prospecive drug decreases ,he b.nd.ng aff,ni.y of .he S.a. pro.e.n .0 a 

25 concluded ^. ^ ^^^^^^^^^ embodiment, the 

Stai proteins bound to aajaceni u.>. ^ . . cpo ID 

.runled S.a. pro.e.n bas an anrlno acd sequence .ha, Is subs,a„.,aU, s,„„.ar .o SEQ 



NO:3. 



,y. A of identifvina a drug that enhances the interaction of 

proteins IS determmed. 1 he eiieci ui a h p ff;„-,tv nf the 

. ■ H If the orospectivedrua increases the bmdmgaffm.t> of the 
DNA binding is determmed. If the prospective „ 
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Stat protein to a DNA binding site, it becomes a candidate drug. The binding affinity of 
the corresponding truncated Stat protein to that DNA binding site is also determined. The 
effect of a candidate drug on the affinity of the truncated Stat protein-DNA binding is 
determined. If the candidate drug has no effect on the truncated Stat protein-DNA 
5 binding then it can be concluded that the candidate drug enhances the interaction of 

N-terminal domains of Stat proteins bound to adjacent DNA binding sites. In a preferred 
embodiment, the truncated Stat protein has an amino acid sequence that is substantially 
similar to SEQ ID NO:3. 
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In another embodiment, a drug library is screened by assaying the binding activity of the 
two N-terminal fragments of the present invention. As disclosed in the present invention, 
the N-terminal fragments of Stat proteins form stable dimers in solution. These dimers 
could mimic the role the N-terminal domain plays in the native Stat protein. Therefore, a 
prospective dru. capable of disrupting or enhancing the stability of the dimer formed 
between two N-terminal fragments becomes a candidate for a drug capable of destabilizing 
or stabilizing respectively, N-terminal domain-dependent Stat-DNA binding. These 
candidate drugs then can be tested in an in vitro or in vivo assay with Stat proteins. For 
example, dimerization of the N-terminal fragments in solution can be determined using 
techniques such as fluorescence depolarization. 
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In yet another embodiment, an N-terminal fragment of a Stat protein is attached to a solid 
support. The solid support is washed to remove unreacted species. A solution of free N- 
terminal fragments is poured onto the solid support and the N-terminal fragments are 
allowed to form dimers with their bound counterparts. In one variation, the solid support 
25 is washed again to remove N-terminal fragments that do not bind. Prospective drugs can 
be screened for their ability to disrupt the dimers. or the formation of the dimers. and 
thereby increase the concentration of free N-terminal fragments. In a variation of this 
embodiment, prospective drugs may be screened that enhance the binding of the free N- 
terminal fragments with their bound counterparts. In this case, there is a corresponding 
30 decrease in the concentration free N-terminal fragments. In either case, the measurement 
of an equilibrium constant, or a dissociation rate constant or an off-rate, may be used to 
express the effect of the prospective drug on the N-terminal fragment dimer binding. In 
another variation of this embodiment, prospective drugs that modulate the interaction of 
the N-terminal domain can be screened by determining the amount of N-terminal fragmem 
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that remams bound in the presence of the prospective drug. As compared to the an.ount 
of bound fragment ,n the absence of a prospective drug, prospect.ve drugs that disrupt the 
interaction result m lower levels of bound fragments, whereas prospective drugs whtch 
enhance the interaction result ,n higher levels of bound fragment. One method of 
inonttortng such tnteracttons is through the use of free N-terminal fragments whtch have 
been labeled Some suitable labels are exemplified below. Alternat.vely, the d.mer.zation 
of the free N-terminal fragments with the bound N-termtnal fragments can be momtored 
by changes in surface plasmon resonance. In preferred embodiments the N-termtnal 
fragment has an ammo acid sequence substantially similar to SEQ ID NO:4. 
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In yet another embodiment, the affect of a prospective drug (a test compound) on 
interactions between N-terminal domains of STATs .s assayed in living cells that contain 
or can be induced to contain activated STAT proteins, i.e., STAT protetn d.mers. Cells 
containing a reporter gene, such as the heterologous gene for luciferase, green fluorescent 
15 protein, chloramphenicol acetyl transferase or B-galactosidase, operably l.nked to a 

promoter comprising two weak STAT binding sites are contacted with a prospective dt^g 
. the presence of a cytokine which activates the STAT(s) of interest. The amount (and/or 
activity) of reporter produced in the absence and presence of prospective drug is 
determined and compared. Prospective drugs which reduce the amount (and/or activity) 
20 of reporter produced are candidate antagonists of the N-termtnal interaction, whereas 
prospective drugs which increase the amount (and/or activity) of reporter produced are 
candidate agonists. Cells containing a reporter gene operably l.nked to a promoter 
comprising strong STAT binding sites are then contacted with these candidate drugs m 
the presence of a cytokine which activates the STAT(s) of interest. The amount (and/or 
25 activity) of reporter produced in the presence and absence of candidate drugs is 

determined and compared. Drugs which disrupt interactions between the N-termina 
domains of the STATs will not reduce reporter activity in this second step. Similarly 
candidate drugs which enhance interactions between N-terminal domains of STATs will 
not increase reporter activity in this second step. 

30 

,„ an analogous e^bodimem, .wo reporter gene, each operably under .he eon.ro, of one 
ot .he .wo ,ypes promo.ers described above can be comprised in a single bos, eel, as ong 
as ,he expression of .he ,wo reporrer gene produc.s can be dis.inguished. For example, 
differen. m«,if,ed forms of green flnorescen. pro.ein can be used as described ,n U.S. 
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P«=„, 5.625,048. issued Apri, 29. 1997, hc,*y .ncorpora.ed by reference in i.s en.irery. 

A„„g„„lsrs ot ,he STAT N.emrina, rnreracon world be expecred ,„ arrtagor.ize aspecrs 
of STAT hrncon. Sue. cand.d.re drugs are expecred ro be useful for rhe .rea.nren, of a 
variery of disease srares, including ba, no, limiied ro, infl.mnrarion, allergy, as.luna, and 
leukenrias. Candidate drugs which srabilize ,he N-rcrnrinal inreracion would be expec.ed 
,0 enhance STAT funcrion, and may rherefore have ur.liry in rhe rrearmem of anemras, 
neurropenias, rhronrbocyropenia, cancer, obeslry, .iral drseases and growrh rerardarron, or 
other diseases characerized by a insufficiem STAT ac.iyrty. 

These and orher aspecrs of rhe presen. invenrion wil, be ber.er apprecared by reference to 
the following drawings and Detailed Description. 

pp.cc r,i;^riilPTION T"" nuAWINGS 

' Figure ,A, Polyacrylamide gel elec.rophore.ic analysis of .he purified nonphosphorylared 
prLns. Ali,uo,s of S.a.l« ,l.ne 2; 2.g,, Srat.^ (lane 3; 2.g), and Stat.tc .lane 4; .g, 

„ , 7? SDS-PAGE gel and stained with Coomassie blue. Molecular weigh. 

were run on a l h bu^-r/wjx:, gt-i « 

standards were run in lane 1. M, is given as the kDa on .he left. 

Figure IB Pro.eolysis of human S.a.l». 40.g of purified S.a.lo were d.ges.ed wi.h 
various an,oun.s of sub.ilisin (lanes 4-6) or proreinase K (lanes 9-11, for 30 m.n on rce (as 
described in Marerials and Methods, The ra.,os (w./w., of pro.ease .o pro.e,n were 

,8 (lanes 4 and 9), 1:80 (lanes 5 and .0), and 1:800 (lanes 6 and U). A,i,u„.s of the 
25 reactions were resolved on a 16.5% SDS-polyacrylamide ge, followed by Coornass.e 

u in vn-,- Wnes and 7 untreated Statla; lane 

staining. Lane 1, molecular .eight standards m kDa, lanes " ^"'^ 

3, subtiUsin (15.g); lane 8. prote.nase K (I5.g). Stable fragments of 65 .Da and 16 .Da 

(see text) are marked with arrows. 

30 Figure 2A. Phosphorylai.on of S.a.l» wi.h EGF-recep.or kinase « v,Vr„. 2.g of S.a. 
pro,ein was incuba.ed wt.h EGF-recepror and l.C, of »P .ATP for 6 h a. 4 C. The 
Lacion (20.1 volume, was s.opped by .he addi.ion of '^^^ J";, 

7* SDS-PAGE. which was subse,ue„.ly dried and exposed .o an X-ray film. The .yp.cal 
double, pattenn for phosphorylaied S,at, (Shua, « al. 1992) ,s seen ,n the Coomass.e 
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stained gel in lane 2. Lane 3 .hows .he corresponding au.orad.ogran,^ Only .he slower 
: .and conrarns O deno.es .he pos.non of .he pHosphorvla.ed BCP-recepror, 

pa!, and siow nngranng Sea. pro.e,ns are po.n.ed on, wi,h lines. Lane I conrarns .he 
„,oiec„,ar we.gh, ™ar.ers and .he.r respecnve .olecular w.ighrs are denced ,n 

' of .i.ro phosphorylared Srarlr. A ■o.al of 25 of pro.ein was 

hepar,n agarose coiun. after an . v„™ phosphory,a,io„ reacon and removal 
of ECF-recep.or (see Marerials and Merhods). Depicred is rhe column profile of UV- 
ot bUl- recepioi ISOmM KCl.and 400mM 

absorprlve material elured wi,h ,r,ccess,ve sreps of 50mM KCI, '^O"* 
0 KCI Five microlners of .he indica.ed frac.ons (23 ml, were resolved b, 7% SDS PAGE 
Z stained w.th Coomassie blue (lower .nsert, or blotted on a nitrocellulose membrane 
with an an.i.phosphot,rosine.antibod, (L,300 diluted PV 20 (UBI). upper 
insert). Molecular weights are denoted in kDa. 

,5 Figure ^ Tyros.ne 70, is phosphorylated by EGF-recep.or. The endopro.etnase AspN 
min, were carried out on ai^.^^ 3ta.« in either the unphosphorylated form 
phosph. upper ham or the chromatograph.call, purified phosphorylated form (. 
phosph lower half). The relevant proteins of the matri.-assisted laser 

snectmm are shown Accurate molecular mass determmat.ons 
desorption/iomzalion mass spectrum are snown 

20 allowed for une,uivocal idenrifiearion of the peptide fragments. Peaks are labeled 
according to the amino acid sequence of the corresponding peptides. 

Fioure 3A DNA binding of purified phosphorylated S.a.l» (lane 1, and Statltc (lane 2) 
usLg as a probe the radioactiveiy labelled c^WT sequence. B.ndtng — 
25 equimolar a™oun.s of .he respective proteins. The posit.on of mtgratton o, the tree DNA 
p obe (free, and the protei.ONA comple. (bound, is indica.ed. Note the prese. of a 
slower migrating band only w„h the full length Sta.K.. lane 1 (see also F.gure 3B,. 

Figure 3B. InHuence of cysteine al.yla,ion on the DNA binding activity of S.a.lc. A 
30 m,..ure of phosphorylated and unphosphorylated pro.e.n (0.23 .M final - 15% 
phosphoprotetn) was reacted in .he presence of 0.8 n^ DTT and the .nd.cated 
oncentranons of N-ethyl-maie.mide (NBM, for 20 min a, room temperamre, aoume 
Of p 5 .i. The reaction was stopped with DTT (final concentrafion of 10 -^» f° '"^ 
b, the a d.t,on of 1 .5 pmoies of labelled probe (cib.M67>. Samples were resolved on 
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4.5% native polyacrylam.de gel. (M) denotes the position of bromophenol blue (lower) 
and xylene cyanol (upper) markers. 

'plS^^'^Tiu^on 'o7"P labelled cfosWT oligonucleotide with phosphorylated Statltc 
' and full length Static. A fixed amount of -P labelled cfosWT oligonucleotide (5.6 x lO '" 
M) was incubated with Statl proteins in a 12.5 ^1 volume as described in Materials and 
Methods. Numbers above the lanes indicate the concentrations of dimenc Static and 
Statltc in each reaction. Protein-bound (bound) and free (free) DNA is identified. The 
concentration of free protein dimers at half saturation was determined to be approximately 
1 nM in both cases which corresponds to the apparent equilibrium constant K.,. In the 
lanes marked above "DNA only" no Stat protein was included_in the reaction. 

iliSfmltSof phosphorylated truncated Statl protein with "P labelled 
olLnucleotides containing a "low" (Ly^E, left panel) or "high" (SI, right panel) 
affinity binding site. The DNA concentration was fixed at 2.6 x 10- M and titrated in a 
12.5 Ml volume against a standard protein dilution series ranging from 5 x 10" M to 2.6 x 
10- M dimer final. Protein concentrations for the dimeric protein are given above each 
lane The products were resolved on a native 4.5% polyacrylamide gel and quantified as 
described in experimental procedures. (Bound) protein/DNA complex; (free) free DNA. 
) There was no Statltc included in reactions run on lanes denoted "only DNA". The dimer 
concentration at half saturation was determined from this autoradiograph to be 
approximately 1 x lO" M for both DNA sequences. 

i^l^^S^le'SoiSx of Statla with cfosWT DNA is less stable than the complex 
5 ^ith cfosM67 DNA. Results are shown for experiments designed to determine the off-rate 
.n which 0.55 X 10-^ M dimer was prebound with the radiolabeled DNA fragments (at 2 x 
10- M) containing the cfosWT (0 min; left panel) or cfosM67 (0 mm; right panel) 
sequences. Excess unlabelled DNA (100 x molar excess) was added to the reaction at time 
zero and aliquots were taken at the indicated intervals and loaded onto a rumiing gel to 
50 visualize the amount of complex remaining. The half lifo of the Statla/cfosWT complex is 
less than 0.5 min and that for the Statltc/cfosM67 complex in this titration is about 3 mm. 
Because the electrophoresis was continuous during the experiment the DNA fragments 
(free) and the complexes (bound) are located progressively higher on the gel with 
increasing time, because the later samples were electrophoresed for shorter periods of time 
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than were the earlier ones. 



P„„e comparison of ,.e .ssoca.on ,a.es o, co.p,=«, co„,.n.n, DNA ..,™n,s 
Figure OA P 5,^^,^ s.atuc 

DNA a, ..e point .eac.on wa. Un.. '"^^^^^-" -^^^Zo .wo 

;„a,ca.ed before a,„u„. .ere ioade. on a running p„„ac,ylan„. e,. A n.e e„ 
aif,eren,iyn,isra,ing complexes are .isi..e,deno.ed -(2 x(0,.eO, and D..er . 

0 Unbound (free) DNA runs at the bottom of the gel. 

H.nre .B. ,de„t,ncat,o„ of tbe a.,no temtina, >M amino '^^ ^^^^^ 

(Oimer,) stabilization on DNA. Contparison of stab.bty of StatfS = ^ 
' , - 4, on DNA fragtnents containing two consecutive b.nd.ng snes P.. c^WT, 
,5 iObp apart). The experimental protocol was the same as in Figure 6A. 

F,g„re 7A. mnuenc. of promotor orientation on pro,e.„,DNA '^'TJ^T: 
. bllity. 1.6, X 10- M S,atl„ d.mer were et,uillbrated .-•™»- ^ 
wUh two consecutive binding sites (2x cfeWT) lObp apart m parallel lanes 1 4, 
,0 Itiparallel Oanes 3 - S) orientation. The preformed complexes were chased w,th 
unlabelled competitor DNA as described in the legend to F.gure 6A. 

Figure 7B. Static b.nding to DNA fragments with two parallel bmdmg sites Px c^WT, 
l;!:ed lObp .lanes M,, 3bp (lanes .8,, or ,3bp <lanes ,-12,. The chase expenmen. 
25 performed as described in the legend to Figure 6A. 

DEIiOLEDJJEailEnaNi^^ 

The present mvemion includes methods for producing milligram quantities of d.fferent 
30 Xld Sta. prorerns from recomb.nant DNA co.tmcts. One .ey a c of he 

present invenrron is the rsolation of purtfred phosphorylated Stat protems. Anothe Rey 

of the methods of rhe present invennon comprises rhe modincation of specific 
cysteme residues of the Stat proteins that prevent aggregation. In one preferred 
rldiment, the modification of the cysteine residues is perfon^ed by alKylat.on. 
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The present invention also tnclt^des a stable, soluble truncated Stat protein that reta.ns 
most of the functional activtt.es of the corresponding nat.ve Stat protem. Smce a 
significant portton of the recombinant truncated Stat protein does not form tnc.uston 
bodies and therefore can be isolated in large quanttt.es (40-50 n.g of purified alkylated 
truncated Statl protein can be obta.ned from 6 Hters of start.ng culture,) tt is an excellent 
source of protein for the crittcal .arc studtes necessary to understand and later, control 
the signal transducing properties of Stat proteins. Nucleic acids that encode for a 
truncated Stat protetn are also a part of the present invention. The present mvention also 
mcludes methods of usmg the truncated Stat protems for identtfying drugs that specrfically 
effect the tnteraction of N-terminal domains of Stat proteins that are bound to adjacent 
DNA binding sites. 

The presen. .nvemion includes the idemif.cation and isola„o„ of an N-«rminal fragmen, 
comprised of a compac, don,a,n in .he am.no terminus of S,a,l„. Tl„s compact domain 
enhances the DNA binding of the Sta, protein due ,0 it. abUity to interact w.th a 
neighboring Siat protein. Methods of using this N-termina, fragment to idemify specftc 
drugs that act to ether prevent or enhance the DNA b.nd.ng of Stat prote.ns through 

interfering with or promoting the tnter-ptotein interaction of the N-term,n.l domatn of Sta, 

proteins are also included. 

The present tnventton also includes methods of phosphorylating, .n .Uro. the tyrosine 
residue of Stat protetns, Icnown in .ivo to cause the dimer.zatton of the Stat protem upon 
being phosphorylated. In one preferred embodtment, actuated EGF-receptor parttal y 
purtfted from membranes by Immunoprec.pttation rs used to catalyze thts phosphorylatton. 

addi-on tb. present tnvention includes methods of separating a phosphorylated Stat 
protem from .ts corresponding nonphosphorylated form. Heretofore such separation 
could not be ach.eved due to the unusual behavtor of Stat proteins on gel filtrat.on 
columns. 
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Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 



As used herein a 



•converted cysteine" implies that a cysteine residue of a Stat protein 



or 
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,„„ca«. S,a, prcin of .h. presen, .nvmlon has eUher b«„ edified or replaced by .„ 
auerna,ive na.ura,ly oceurr.ng o, sy„.b«ic a.i„o acd, A converted cys.em. can be of 
,he form of a modified cysteine sucb as a cysteine having tts thtol group blocked, an 
analogue of cysteine such as homocysteine, or an amino acid replacement for cysteine 
such as a alycin. or a serine. Mod.fication of the modified cysteines can be accomphshed 
through al^ylation (i ... forming an alkylated cysteine,, or by metcutat.on. or through 
disulfide bond formation. 

AS used herein a the term -Sta, protein" includes a particular family of transcription factor 
conststmg of the Signal Transducers and Activators of Transcription P-'- 
proteins have been defined in Interna.tona, Patent PubUcation No.s WO 93/19, ( 0 
September 1993. by Jantes E. Darnell. Jr. « WO 95/08629 <30 March ,995. by 
James E. Darnel,. Jr. « an and United States application hav.ng a Serial Number 
08/2,2 184. filed on March U, ,994. entitled. -Interferon Associated Receptor 
Kecognitton Factors. Nucleic Acids Encoding the Same and Methods of Use Thereof by 
James E Darnell. Jr. « ./-. all of which are incorporated by reference ,n their enttrettes. 
herein. Currently, there are seven mammalian Stat family members which have been 
,de„,if,ed. numbered Stat 1. 2. 3. 4. 5A. 5B. and 6. Stat proteins include protems der.ved 
from alternative splice sites such as Human Stat,„ and StatW. .... Static is a shorter 
„ protetn than Static and is translated from an alternatively sphced mRNA. Mod.f.ed Stat 
proteins and functional fragments of Stat proteins are included in the present .nvent.on. 
One fitnctional fragntent is a truncated Stat protein defined below. 

AS used herein a the tem, "truncated Stat protetn" denotes a Stat protein fragment having 
,5 an N-emtinal amino acd sequence that is substantially similar to the N-.entt.nus of the 
■ corresponding htll-leng.h Stat protein following the cleavage of the ptoteolytic sens.t.ve N- 
temtlnal dom'ain from the corresponding ^,1-length Stat protein. The carboKyl term.nus 
of the truncated Sta, protein extends a. least to the phosphorylatable tyrosine requtred for 
homodimertzafon. Truncated Sta, proteins are soluble proteins that can be 
30 phosphorylated. dimerize and bind to the DNA bindtng s.tes of the .ll-length Sta, pr«.n. 
An exantple of a truncated Stat ptotein is S.atUc having the amino acid sccuence of SEQ 



ID NO.j. 



AS used herein the terms "phosphorylated" and "nonphosphorylated" as used in 
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pKospHo.,.,ea. .e, for. .co- .e,e,oa>™nc ^ 

,^ tho «;Rr homoloev domain (SHi) ot tne ouici. u> 
nhosphotyrosine of one partner binds to the SRC homology 

™.L - fo™e. a,™. ,Ke„ .a.>oca.es f.o. ..e c„op.a™ ,o 

,He nucleus, b.nds to a paU„drom,c OAS sequence, .hereby ,c>iva„„8 ua„scr,p.,o„ 

,„ a spcciHc e.bo.,.e„,, »„ an„no aci. sequences ot .be ..nca.e. S,a. P-'" - 
■.s„bs.an.aUy hon,o,ogou,- or ■subs.an.ia,,, simrla." when a. leas. abou. 75% ,pre erably 
a. ,eas. abou 90%. ana n,os. preferably a. ,eas. abou. 95 or 98*. of .he an,.no aCs 
I c over .be ..Hne. ,e„g.h o, .he a.,„o acid sequences; and .he N-,e„.,na, don,a,„ of 
r orrespond., M,-.n,h S.a. p.„.e>n ,s a. ,eas. «.y percen. de,e.ed fron. o.h ^.no 
acid sequences. Anaiogousiy. .wo amino acid sequences of .he S.a. N..er..na> pep .de 
:le n.s are ■subs.an.iaiiy ho-ogous- or .subs.an.,a,.y sinniar" when a. i«s. ^u. 
, ,5; (preferably a. ieas. abou. 90%. and .os. preferably a. ieas. abou. 95 or 8%) of 
amino acds nrarch over .he defined ,eng.h of .he anrino acid sequences; an he 
N..ermina, pepride fragmen. can form homodimers. Sequences ,ha. are subs.a„.,a.ly 
homologous can be iden.if.ed b, comparing ,he sequences using s,andard sofrware 
available in sequence data banks. 

,„ a specific embodimen.. .wo nucleo.ide sequences coding for .he expression of .he 
rru„ca.ed S.a, p.o.e.n of .he presen. inveniion are ■subs.an.iaiiy homologous or 
..subs.an.i.l,y s,milar- when a. leas. abou. 50% (preferably a. leas. *° ' 
preferably a. leas. abou. 90 or 95%) of .he nucleo.ides ma.ch over .he define^ eng.h 
,5 he nuc,eo.ide sequences; and .he coding region for ,he N-.erminal domain of .h 

corresponding fuil-leng.h S.a. prcein is a. leas. fif.y percen. de,e.ed (or f'— 

fl I coding reg,on> from bo.h „uc,eo„de sequences. Sequence. 

homologous can be ,den.ified by comparing .he sequences us.ng s.andard sof.wa,e 

available in sequence data banks. 

• n the Stat prntHn truncated Stal^rotem 

PurifkMmandmiacmz^ 

The S.a< prcein and .runcared S.a. pro.e,n of .he presen. invemion and homologues 
.hereof can be pur.fied as .augh. here.n, using an, number of ai.ernai.ve equ.vaien. 
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p„„d„r« .ha. =.co.pass a «,de va,i«y ot known purificauon steps. Those wnh ski,, in 
L ,rt .ou,a know ,0 refer ro references, such as ,he Me..,ods of En.yn,o,ogy ser.es. for 

greater detail and breadth. 

,„ a specific enrbodimen.. exenrpi.f.ed a suirable procedure for purifying a S,af 

pr„,e,„ ot .he presen, ,nven,ion ,s descr.boi as follows. One sk.lled in .he a„ o, pro,e,n 
purification would know ihar any such general procedure would probably need ,o be 
lodified for any given Sr.. pro.ein and as such, performing .he requisUe modif,ca„o„, 
would nol be considered undue experimeniation. 

Expre,nonmdpurifica,io^ofarecombimnlS,a,promn^ _ 
Nucleic acids comaining sequences codmg tor a S.a, profein are ampl.fied by PCR 
primers comaining resmcion sires in addirion ,o homologous sequence. The pro ucts are 
then cloned using .he restriction si.es into a baculovims transfer vector, e.g. pAcSG-. 
Recombinant vectors are subsequently co-transfec.ed wi.h baculovims DNA, such as 
Baculogold. in.o Sf9 insec, cells. Recombinan. vi^ses can be iden.ified by .mmuncblot 
ex.rac.s ot .he inteced ceils. For prorein production Sf9 cells in a suspension culture 
(approximately 10' cells/ml) are inteced wid, recombinant vtruses (mulnphcty ot 
infection: 1:5) and harvested by low speed cenrrifugation approximately two days 
20 following infection. 

The resulting cells, generally in quantities of between 10- - lO', are lysed in ice cold 
extraction butter [approximately 80 mis of a low concentration Mes butter (20.50 mM 
containing, .00 mM KCl, 10 NaF, 0.02% NaN„ 4 n^ EDTA, I mM EOTA 20 
2. mM DTT and Complete™ protease inhibitors (Boehringer MaMtheim), pH adjusted wtth 
sodium hvdroxide .o pH 7 0, wi.h a dounce homogenizer. All subsequen. steps are 
performed a, 4-0 unless noted otherwise. For optimal results all buffers used dur.ng 
protein purTtca.ton are ch.Ued, thoroughly degassed and flushed with N= before use. 

30 The resulfng lysa.es are cleared by low speed centrihtgation. The supernaran, is brought 
,o about pH 6 after the addition of 0.5 vol of a buffer such as 20mM Mes contatmng 
0 02% NaN,. 20 mM DTT. pH adjusted to about 6.0) and the supernatant ,s aga.n 
cemrituged. The clar.r.ed superna.am is loaded onto a canon exchange resm. e.g. . 
SEPHAROSE, in a shor.. fa. column, e.g.. 5 x 5.5 cm. and elu.ed wi.h a l.near sal. 
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.radient (50-300 mM monovalent salt) and pH gradient (pH 6-7). Fractions contam.ng 
scat protem are idem.fied by, e g. immunoblot, then pooled, and the pH of the pooled 
fractions are adjusted to 8.0 with IM Tris. After the addU.on of 0.25 vol of a low 
concentration buffer such as 20 mM Tris-HCl contain.ng 0.02% NaN„ 10 mM DTT, at 
5 about pH 8, the solution is loaded onto an anionic exchange resm. such as Q-Sepharose, 
in a . 5-. a 2 X 9 cm column. The Stat protein is eluted with a linear monovalent salt 
gradient from 100 mM to 300 mM. Eluted Stat protem is precipuated with solid 
ammonium sulfate to 60% saturation. The resulting concentrated Stat protems are 
dissolved in about 10 ml of 50 mM phosphate buffer. pH 7.2, containing 2 mM DTT, 1 
10 mM EDTA, and Complete™ protease inhibitors. The Stat protein is then alkylated. In 
one embodiment, alkylation is performed with N-ethyl-maletrnide which ts added to a ftnal 
concentration of 20 n^. The alkylation reaction mixture is incubated at room 
temperature for 10 min and then placed on ice for another 30 min. The reaction ,s 
stopped by the addition of ^-mercaptoethanol to 50 mM and ammomum sulfate to 0.5 M. 
15 The resulting reaction mixture is then loaded onto a low substituted Phenyl-Sepharose 
column (e.g., 2 x 15 cm) equilibrated in a low concentration buffer such as 20 mM Tris- 
HCl about pH 7.4, containing 2 mM DTT plus 0.5 M ammonium sulfate. The Stat 
proteins are eluted with decreasing ammonium sulfate dissolved in the column 
equilibration buffer. Fractions containing Stat protein are pooled, and then concentrated 
20 to about 10 mg/ml using .g. , a centriprep 50. The concentrated sam.ple is then applied to 
a gel filtration column, such as SUPERDEX 200 (XK 16, Pharmacia) equilibrated m low 
concentration buffer such as 20 mM Hepes-HCl, pH 7.2. contaimng 0.02% NaN3, 2 mM 
DTT and0 3MKCl. Fractions containing the Stat protein are pooled. The pooled 
fractions are then concentrated by ultrafiltration to approximately 20 mg/ml and quick 
25 frozen on dry ice. The purified proteins are stored at -70"C. When purifying substituted 
Stat protein containing converted cysteines, in which the cysteines that are involved in the 
inter-protein aggregation have been replaced, the alkylation step is Uft out. The 
procedure is otherwise analogous. 

30 Expression and purification of a truncated Stat protein . 

A portion of a Stat gene encoding a truncated Stat protein is amplified by PCR with 
primers containing restriction sites in addition to the desired sequence. The products are 
then cloned into a bacterial vector, the pET20b expression vector (Novagen) using 
these restriction sites. Growth and induction of transformed E. col. e.g. , BL21DE3 
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<pLysS) is performed by standard procedures, such as descr.bed by Studier and Mot au. 
1986 (in th>s particular case the induction was carried out tor 4 hours at 30°C wuh 0.. 
n.M ITPG) Generally, about 50% of the induced protein remains soluble. This soluble 
truncated Stat protein is the isolatable form of the recombmant protem. Cells are 
5 collected by centrifugation and resuspended tn ice cold extraction buffer at a concentration 
of about 30 g of cells to 100 mis of a low concentration buffer, e.,. , 20 nvM Hepes/HCl 
pH 7 6 containing 0.1 M KCl, 10% Glycerol, ImM EDTA, 10 mM MnCl. 20 mM 
DTT 100 U/ml DNase I (Boehnnger Mannheim), and Complete™ protease mh.buor. 
Cells are lysed by multiple cycles of freeze/thawing. Lysis is continued at 4''C while 
.0 stirring slowly for about an hour. The lysate is then centnt^ged for about 20 mm at about 
20 000 X g at 4"C. Polyethylemmine (0.1% final; Sigma) is added to the supernatant, the 
solution gently mixed and centrifxiged for about 15 min at about 15,000 x g. All 
subsequent steps are performed m the cold (4"C) unless stated otherwise. 

15 The supernatant contaimng the soluble truncated Stat protem is precipitated -th saturated 
ammonium sulfate solution in two steps (0-35%; 35-55% saturation final). The 35-55 . 
pellet IS redissolved m about 20 ml of 50 mM phosphate buffer, pH 7.2, containing 2 mM 
DTT 1 mM EDTA. and Complete^' protease inhibitors. The truncated Stat protem is 
then alkylated. In one embodiment, alkylation is performed with N-ethyl-maleimide 
.0 which IS added to a final concentration of 20 mM. The alkylation reaction mixture is 

incubated at room temperature for 10 min and then placed on ice for another 30 mm. The 
reaction is stopped by the addition of /3-mercaptoethanol to 50 mM and solid ammonium 
sulfate to 0.9 M. The mixture is then loaded onto a Fast Flow Phenyl-Sepharose column 
(low substituted, 2 X 15 cm) that had been equilibrated in buffer such as 50 mM Tr.s/HCl, 
25 pH 7 4 containing 1 mM EDTA. 0.02% NaN3. 2 mM DTT, plus 0.9M ammonium 

.u.fate After washin. the column, a linear decreasing salt gradient from 0.9 M to 0.05 
M ammonium sulfate in the equilibration buffer, is applied. The truncated Stat protein 
elutes at about 0.5 M salt. The fractions containing truncated Stat protein are pooled and 
dialysed overnight against 2 x 4 liters of a buffer such as 40 mM Mes/NaOH pH 6.5, 
30 contaimng 10% Glycerol. 0.5 mM EDTA, 0.02% NaN,. and 140 mM KCl. This material 
.s loaded onto a cation exchange resm, e.g., S-Sepharose. in a short, fat column, e.,.. . 
X 5 5 cm and a linear 500 ml gradient of a buffer such as 40 mM Mes/NaOH pH 6.5, 
contaimng 10% Glycerol, 0.5 mM EDTA. 0.02% NaN, containing 140 mM to 300 mM 
KCl was applied. The protein generally elutes at approximately 220 mM KCl. Fractions 
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conta,n.ng the truncated Stat protein are collected and d.alysed against 3 Hters of a buffer 
such as 50 mM Tris/HCl, pH 8 conta.n.ng 10% Glycerol, 2mM DTT. and 50 mM KC, 
with one change of buffer. The protein solution ts loaded onto an anion, exchange res.n. 
such as Q-Sepharose. m a column a 2 x 9 cm and bound proteins are eluted wuh a 
5 linear gradient from 50 to 300 mM KCl in a buffer such as 50 mM Tris/HCl, pH 8 

containing 10% Glvcerol, 2mM DTT. Fractions containing the truncated Stat protein are 
combined and precipitated with solid ammonium sulfate to 55% saturation. At this stage 
the 95% pure preparation can be stored at -20"C until subjected to in .uro phosphorylation 
or is directly loaded onto a gel filtration column, such as Superdex 200 (XK 16; 
10 Pharmacia) equilibrated with 10 mM Hepes/HCl, 7.4 containing 100 mM KCi, 2 mM 

DTT and 0 5 mM EDTA. In this case the precipitated protein ,s first dissolved m about 
2 ml of 10 mM Hepes/HCl, 7.4 containing 100 mM KCl, 2 mM DTT, and 0.5 mM 
EDTA and then placed on the gel filtration columr.. The truncated Stat protein elutes tn a 
symmetrical peak and is concentrated to a concentration of about 20 mg/ml ustng a 
15 Centriprep 50, for example, and quick frozen on dry ice. The pure alkylated truncated 
Stat protein is stored at -70°C. Typically yields of 40-50 mg (greater than 98% pure as 
judged by Coomassie blue stain and mass spectroscopy) of truncated Stat protein from 6 
liters of starting culture can be obtained. Any person skilled in the art would know to 
scale-up this procedure when a greater quantity of Stat protein is needed, and to 
20 scale-down the procedure when less purified Stat protein is required. 

When purifying substituted truncated Stat protein containing converted cysteines, m which 
the cysteines that are involved in the inter-protein aggregation have been replaced, the 
alkylation step is left out. The procedure is otherwise analogous. 
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on. kev aspect of the present invention need to be emphasized: the identification of a 
soluble truncated Stat protein that is crucial for preparing large an.ounts (30-.0 mgs) ot 
Stat protein m a single preparation. Heretofore, essentially all of the recombinant Stat 
protein expressed in a bacterial host, accumulated entirely in insoluble inclusion bodies. 
30 The present invention has overcome this problem by producing a truncated protein that is 
soluble in significant quantities. 

Preparauon of EGF-receptor kinase and ,n .aro phosphorylauon of Stat proteins. Human 
carcinoma cells such as A431 cells, are grown to 90% confluency in 150 mm diameter 
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p,„es ,n D„lb.cco-s mCiM Eagle', meO.un, supple^emed wi,h .0% bovine ca,r serum. 
The ce„s are washed once wi.h chii.ed phosphate buffered saline. PBS, and lysa.es are 
,hen eonvenlenrly prepared In about I ,ul of ice cold lysis buffer per plare. such as 1 
Hepesmci. pH 7,3. conrain.ns .50 ^ NaCI. 0,5. TrUon X-.OO. 10. Glycerol. 1 n*l 

5 SaVO, lOmMEDTA and Complere'" protease inhrblrors. After about 10 mtnutes on 
,ce the cells are scraped, vortexed and dounce homogenized. The lysates are cleared by 
centrifugatlon at 4-C. e.g.. by centrifuging for 20 n,in a, top speed ,n a„ Eppendort 
nricrofuge. The resulting supernatant ,s stored at -TO'C until needed. Immediately before 
use. one volunre of the lysates ,s mixed with four volumes o, the lysis buffer forming 

10 diluted lysate. 

EGF-receptor precipitates are obtained by incubating 5 ml of "diluted lysate with about 50 
,g of an ant-EGF-receptor monoclonal anttbody directed against the extracellular omam. 
After two hours of rotating the sample a, 4.C. 750 ,1 of Proteitr-A-agarose (50% slurry; 
,5 oncogene Science) is added, and the incubat.on proceeds while rotating, for about one 
more hour. Agarose beads containing the EOF-receptor immnnoprecipttates are washed 
exhaustively (5-10 times) with lysis buffer and then at least twice tuore with a s^ntge 
hutter such as 20 mM Hepes,HC, containing 20% Glycerol, ,00 mM NaCl. and 0., mM 
Na.VO,. Precipttates from 5 ml diluted lysate are dissolved in 0,5 ml of the storage 
20 buffer, flash frozen on dry ice and stored at -70°C, 

(n^edlately before the i. ^ Kinase reaction the Pr„te,n-A-agarose 
from 5 m, dilute lysate is washed once with a Ix kinase buffer such as. 20 n*. Tr.s/HCU 
pH 8 0 containing 50 mM KC. 0,3 mM Na.VO.. 2 mM DTT. pH S O and then dissolved 
25 in 0 4 ml (total volume) of this buffer. Afterwards the washed EGF-receptor precrpttate ts 
ir^ubated on Ice tor about 10 minutes in the presence of a final concentration of mouse 
EGF of 0 15 ng„l- Phosphorylation reacttons ate conventently carrred out in Eppendor, 
tubes in a final volume of 1 ml. To the pre-ineubated kinase preparatton the foUowmg ts 

^ rr., „1 0 1 M DTT 50 u.\ 0.1 M ATP. 4 mg purified Stat 
added: 60 ,.1 lOx kmase buffer, 20 111 0.1 M ui 1. F 

30 protein (e.g, , the Superdex 200 eluate for Stat proteins; and ammontum sulfate pellets 

dissolved ,n 20 mM Tris /HO, pH 8,0 for the truncated Stat protein of the preparattons 

described above), 10 .1 IM MnCl, and d.stilled water is added to 1 ml The reaction ,s 

allowed to proceed fo, about 15 hours at 4C, After 3 hours an addlt.ona, .5 .1 of 0, 1 M 

ATP is added. 
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Separation of pkospkorslated from ur^phosphorylated Stat proteins . The m varo kmase 
reaction mixture (above) .s freed frotn the EGF-receptor bound to agarose beads by 
washing the beads and phys.cally separating the eluate from the beads. Th.s may be 
conven.entlv perfom.ed by sp.m.ing the m.xture through a plug of siliconized glass wool 
5 at the bottom of a pierced Eppendorf tube. The glass wool ts washed wuh 0.5 ml ot a 
buffer such as 20 mM Tr.s/HCl. pH 8.0. contaming ImM EDTA, and 2 mM DTT. Thts 
buffer is also used to equilibrate a heparir, agarose column, (HA-buffer). The pooled 
volumes from the glass wool eluate are loaded onto the equilibrated heparin agarose 
column (15x7 cm) and the columt. is washed with about 50 ml HA-buffer 50 mM 
10 KCl The bound Stat proteins or trur^cated Stat proteins are elated w.th two consecut.ve 
50 ml volumes of HA-buffer p/«. a moderate salt concentration such as 150 mM KCl and 
then HA-buffer plus a htgher salt concentration such as 400 mM KCl. Unphosphorylated 
protems generally elute at the moderate salt concentration and are then concentrated 
by ultrafiltration to about 10 mg/ml, flash frozen on dry ice and stored at -70'>C. 
15 Phosphorylated Stat proteins generally elute at the higher salt concentration and are 

concentrated to about Img/ml. Glycerol .s added to about 50% (vol/vol) and the material 
is stored at -20''C. 

Phosphorylated truncated Stat protein is brought to a concentration of about 15 mg/ml. 
20 The concentrated sample is then applied to a gel filtration column, such as Superdex 200 
(XK 16 Pharmacia) equilibrated in low concentration buffer such as 20mM Hepes-HCl, 
pH 7 2,' containing 0.02% NaN,, 2 mM DTT, and 0.3 M KCl. Fractions containing the 
gel filtered phosphorylated truncated Stat protein are pooled, concentrated to 
approximately 20 mg/ml, flash frozen on dry ice and stored at -70"C. 

Genei^LTedm^^ 



Proteins 
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In accordance w.th the present invention there may be employed conventional molecular 
biology, microbiology, and recombinant DNA techniques withtn the skill of the art. Such 
techniques are explained fully in the literature. See, Sambrook, Fritsch 8. Mamat.s, 
Molecular Clon.n,: A Laboratory Manual. Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook et al., 1989 ): 
Clomns- A Practical Approach, Volumes I and II (D.N. Glover ed. 198.); 
OU^onucleoticie S.ntHes. (M.J. Gait ed. 1984); Nucleic Actd mndization [B.D. Hames 
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& S J Higgins eds. (1985)1; Transcnption And Translation [B.D. Hames & S J. Higg.ns. 
ed. (1984)1 Ammal Cell Culture [R I. Freshney. ed. (1986)1; Imrnomzed Cells And 
En^.rnes [IRL Press. (1986)1; B, Perbal, A Practical Guide To Molecular Cloning (1984); 
F.M. Ausubel et al, (eds.). Current Protocols in Molecular Biology, John Wiley & Sons, 
5 Inc. (1994). 

Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

10 AS used herein, the term "gene" refers to an assembly of nucleotides that encode a 
polypeptide, and includes cDNA and genomic DNA nucleic acids. 

A "vector" is a replicon, such as plasmid, phage or cosmid. to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. A 
15 "replicon" is any genetic element (e.g. , plasmid, chromosome, virus) that functions as an 
autonomous unit of DNA replication in vivo, i.e., capable of replication under its own 

control. 

A "cassette" refers to a segment of DNA that can be inserted into a vector at specific 
.0 restriction sites. The segment of DNA encodes a polypeptide of interest, and the cassette 
and restriction sites are designed to ensure insertion of the cassette in the proper reading 
frame for transcription and translation. 

A cell has been "transfected" by exogenous or heterologous DNA when such DNA has 
25 been introduced inside the cell. A cell has been "transformed" by exogenous or 

heterologous DNA when the transfected DNA effects a phenotypic change. Preferably, 
the transforming DNA should be integrated (covalemly linked) into chromosomal DNA 
making up the genome of the cell. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of ribonucleosides 
(adenosine, guanos.ne, undine or cytidine; "RNA molecules") or deoxy ribonucleosides 
(deoxvadenosine, deoxyguanosine, deoxythymidine, or deoxycytidine; "DNA molecules"), 
or any phosphoester analogues thereof, such as phosphorothioates and thioesters. in either 
single stranded form, or a double-stranded helix. Double stranded DNA-DNA. DNA- 
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RNA and RNA-RNA hei.ces are possible. The cerm nuCeIc acid mc,ec„,e. and in 
pa,.icu,ar DNA or RNA niolecu,., refe. only .0 .he pr.mar, and secondary s.rucure of 
L ..oiecnie. and does no. ,in,i< i. .o any par„c„Ur ,err,ary fo^s. Tir.s. rh,s rennr 
,„c,„des d„„b,c.s,ra„ded DNA found, ..er a,., rn iinea, or crcniar DNA n,o,eco.es 
, resrricnon fragmenis), plasmrds, and c.ronrosomes. In discuss.ng .he s.rucrure o, 
aLiar donHc-srranded DNA nro.ecnies, sequences .ay be described bere.n accord.ng 
1 .he nomra. convenrron of giving oniy .he sequence in <he 5' ro 3' d.rec.on along .he 
„on.ranscribed s.rand of DNA (i-. . .he s.rand having a sequence homologous .0 the 
^NA). A -recomb,n,m DNA molecule" is a DNA molecule .ha. has undergone a 
molecular biological manipulation. 



A nucleic acid molecule ,s -hybridizable" .o an„.her nucle.c acid molecule, such as a 
cDNA, genomic DNA, or RNA, when a single srranded form of .he nuCeic ac, molecule 
can amrea, .o .he o.he, nucleic acd molecule under .he appropriate condi.ions of 
. remperarure and soiu.ion ionic s.rengrh Sambroo. e. al., s„p™,. The cond,..ns o, 
.emperarure and ionic s.reug.h d..ermine .he -s.r.ngency of .he hybrid.a,.on. Tor 
preliminary screening for homologous nucle.c acids, low 
ondiiions, corresponding >o a T„ of 55-, can be used, es- 5. SSC, 0,i% SDS. 0..5. 
conuiuu , f „ , -^v <;<sr 0 5% SDS) Moderate stringency 

milk and no formamide; or 30% formamide, :.x SSC, 0.5% SUb) 

H rn . hiaher T es 40% formamide. with 3x or 6x 
20 hybridization conditions correspond to a higher l„„^.g, . , .0% 

sec High stringency hybridization conditions correspond to the highest T ..... 50. 
formamide. 5x or 6x SCC. Hybridization requires that the two nucleic acids contatn 
complementary sequences, although depending on the stringency of the 1.—, 
mismatches between bases are possible. The appropriate stringency for hybndizin. 
25 nucleic acids depends on the length of the nucleic acids and the degree of 

Iplemen J. variables well known in the art. The greater the 

homology between two nucleotide sequences, the greater the value o. T. tor hybrids ot 

those sequences The relative stability (corresponding to higher T J 
nucleic acids having those sequences. on A -RNA 

of nucleic acd hybridiza.ions decreases .n .he following order: RNA:RNA, DNA,RNA, 
,0 DNA DNA, For hybrids of grea.er .han 100 nucleo.ides ,n ,eng.h, e<,ua.,o„s for 
calculaling T. have been derived Sambroo. e, a,., ,u,ra. 9.50-0 5 1) For 
hybridizarion w„h shorter nucleic acids, i.e., oUgonucleorides, .he pos,.,on of m— 
becomes more imponant, and the leng.h of .be ol.gonucleo.ide determmes ,ts spe ^ 

Sambroo. .. al , su,ra. U.7.,..S,. Preferably a m.nimum length for a hybrrd.zahle 
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„„c,.,c aciJ ,s at leas, abou, 12 nucl.ci.es; preferably a, leas, abou. 18 nucleo.ide. and 
„,„re preferably rhe lengrh ,s a. leas. abou. 27 nucleo.ides; and mosr preferably 36 

nucleotides. 

5 In a specific embodiment, the term "standard hybridization conditions" refers to a T,., of 
55°C, and utilizes conditions as set forth above. In a preferred embodiment, the T,„ >s 
60°C; in a more preferred embodiment, the T,„ is 65°C. 

A DNA -coding sequence- is a double-s.randed DNA secuence .h.ch ,s .ranscribed and 
.0 .ranslared into a polypeptide in a cell in .,ro or v,v„ when placed under Che conrrol o 
appropriare regula.ory sequences. The boundaries of .he coding sequence are de.ernuned 
by a s.ar. codon at .he 5' (amino) terminus and a translation stop codon a. the 3 
(carboxyl) terminus. A cod.ng se<,uence can include, bu. is no. limhed .o. prokaryor.c 
sequences and synrhe.ic DNA sequences. If .he coding sequence is in.ended tor 
,5 expresston in a eulcaryotic cell, a polyadenylation s.gnal and transcr.p.ion .erminar.on 
sequence will usually be loca.ed 3' lo the coding sequence. 

Transcriptional and translattonal comrol sequences are DNA regula.ory sequences such as 
promoters, enhancers, terminators, and the like, that provide for .he cxpress.on of a 
20 coding sequence ,n a host cell. In eulcaryotic cells, polyadenylarion s.gnals are con,rol 

sequences . 

A -promoter sequence" is a DNA regulatory region capable of binding RNA polymerase 
in a cell and ini.ia.ing transcription of a downs.ream (3^ direcion, codmg sequence. For 
25 purposes of defning .he presen. inven.ion, the promoter sequence is bounded at ..s 3 

.erminus bv the rranscrip.ion ini.,a.ion si.e and ex.euds ups.ream <5' direcion) .o .nclude 
the nrtnimum number of bases or elemenrs necessary to tnit.ate .ranscr,p.,o„ a. leve.s 
dcecable above background, Wirhin .he promoter sequence will be found a transcr,p.,on 
initiation sue (conveniently deftned for example, by mapping with nuclease SI), as well as 
30 protem binding domains (consensus sequences) responsible for rhe binding of RNA 
polymerase. 

A coding sequence is "under the control" of transcr.pt.onal and translational control 
sequences in a cell when RNA polymerase transcribes the codmg sequence mto mRNA, 
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which is then trans-RNA spliced and translated .nto the protein encoded by the codmg 

sequence. 

A "s.cnal sequence" .s included at the beginning of the codmg sequence of a protem to be 
^ expressed on the surface of a cell. This sequence encodes a signal pept.de. N-term,nal to 
the mature polypeptide, that directs the host cell to translocate the polypept.de. The term 
"translocat.on signal sequence" is used here.n to refer to th.s sort of signal sequence. 
Translocation s.gnal sequences can be found associated with a var.ety of proteins nat.ve to 
eukaryotes and prokaryotes, and are often funct.onal in both types of organ.sms. 

AS used heretn. the term "homologous" in all its gramnutical_ forms refers to the 
relationship between proteins that possess a "con^on evolutionary origin." including 
proteins from superfamtlies the .mmunoglobuUn superfamtly) and homologous 

proteins from different species (e.g., myosin l.ght chain, etc.) (Reeclc et al., 1987. Cell 
15 50:667). Such proteins have sequence homology as reflected by their high degree of 
sequence similarity. 

Accordingly, the ,ern, "seciuence similarity- in .11 its grammarical tomrs refers .o rhe 
degree of idemny or correspondence between nucleic acid or amino acid sequences of 
20 proteins tha, nray or may no, share a coupon evolutionary origin isee Reeck et al., 
supm) However, in common usage and ,n the instant application, the term 
-homologous,- When modified with an adverb such as -highly," may refer to sequence 
similarity and not a common evolutionary origin. 

25 The term "corresponding to" is used herein to refer similar or homologous sequences, 
whether the exact position is identical or different from the molecule to whtch the 
similarttv or homology is measured. Thus, the term "correspondms to ' re.ers to the 
sequence simtlartty, and no, the numbering of the amino acid residues or nucleofde bases. 

30 A gene encodtng Stat protetn, whether genomtc DNA or cDNA. can be tsoiated from any 
antmal source, particularly from a mammal. Methods for obtaining the S,a, pro,e.. gene 
are well ^ow„ m rhe art, as described above isee. e.,.. Samhroo. e, al., .989, supraY 

A "heterologous nucleotide sequence" as used here.n ,s a nucleotide sequence tha. is added 
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to a nucleotide sequence of the present invention by recombinant methods to form a 
nucleic acd which is not naturally formed in nature. Such nucleic acids can encode 
chuneric and/or fus.on proteins. Thus the heterologous nucleotide sequence can encode 
peptides and/or protems which contain regulatory and/or structural properties. In another 

5 such embodiment the heterologous nucleotide can encode a protem or peptide that 
functions as a means of detecting the protein or peptide encoded by the nucleotide 
sequence of the present invention after the recombinant nucleic acid is expressed. In still 
another such embodiment the heterologous nucleotide can function as a means of detecting 
a nucleotide sequence of the present invention. A heterologous nucleotide sequence can 

10 comprise non-coding sequences including restriction sites, regulatory sices, promoters and 
the like. 

The present invention also relates to cloning vectors containing genes encoding analogs 
and derivatives of the Stat protein, including the truncated Stat protein, of the invention, 
15 that have the same or homologous functional activity as Stat protein, and homologs 

thereof. The production and use of derivatives and analogs related to the Stat protein are 
within the scope of the present invention. 

Stat protein derivatives and analogs as described above can be made by altering encoding 
20 nucleic acid sequences by substitutions, e.g. replacing a cysteine with a threonine, 
additions or deletions that provide for functionally equivalent molecules. Preferably, 
derivatives are made that have enhanced or increased functional activity relative to native 
Stat protein. Alternatively, such derivatives may encode soluble recombinant fragments of 
Stat protein such as Statltc having an amino acid sequence of SEQ ID NO:3. 
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Due to the degeneracy of nucleotide coding sequences, other DNA sequences which 
encode substantially the same an.ino acd sequence as a truncated St,.t protean gene ma> be 
used in the practice of the presem invention. These include but are not limited to allelic 
genes, homologous genes from other species, which are altered by the substitution of 
different codons that encode the same amino acid residue within the sequence, thus 
producing a silent change. Likewise, the truncated Stat protein derivatives of the 
invention include, but are not limited to, those containing, as a primary amino acd 
sequence all or part of the amino acid sequence of a truncated Stat protein including 
altered sequences in which functionally equivalent amino acid residues are substituted for 
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residues within the sequence resulting in a conservative amino acid substitution. For 
example one or more ammo acid residues within the sequence can be substituted by 
another amino acid of a similar polarity, which acts as a functional equivalem, resulting in 
a silent alteration. Substitutes for an ammo acid wthin the sequence may be selected from 
5 other members of the class to which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan and methionine. Amino acids containing aromatic ring 
structures are phenylalanine, tryptophan, and tyrosine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
10 positively charged (basic) amino acids include arginine, lysine and histidine. The 

negatively charged (acidic) amino acids include aspartic acid a_nd glutamic acid. Such 
alterations will not be e.xpected to affect apparent molecular weight as determined by 
polyacrylamide gel electrophoresis, or isoelectric point. 

15 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH, can be maintained. 

20 

Amino acid substitutions may also be introduced to substimte an amino acid with a 
particularly preferable property. For example, a Cys may be introduced a potential site 
for disulfide bridges with another Cys. A His may be introduced as a particularly 
■•catalytic" site (i.e.. His can act as an acid or base and is the most common amino acid m 
25 biochemical catalysis). Pro may be introduced because of its particularly planar strucmre. 
which induces /3-turns in the protein's structure. 

The genes encoding Stat proteins, truncated Stat protein and derivatives and analogs 
thereof can be produced by various methods known in the art. The manipulations which 
30 result in their production can occur at the gene or protein level. For example, the cloned 
truncated Stat protein gene sequence can be modified by any of numerous strategies 
known in the art (Sambrook et al„ 1989, supra). The sequence can be cleaved at 
appropriate sites with restriction endonuclease(s), followed by further enzymatic 
modification if desired, isolated, and ligated in ntro. In the production of the gene 
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e„cod,„g a d^lvauve or analog or a S.a, p».ei„ or a .u„ca.ea S.ac pro,.in. ere sK d 

ra.„ ,0 en^ure .a, .e r^.. -a.„s .a.e 
..„e a. <h= S,a, pro.ein gene. u„,n,err„p.ed .y ,ra„sU.,o„a, stop s„„a,. ,n ,h= gene 
region where Ihe desired activity is encoded. 

^.a,t,o„a„>.. the S,at or truncated Stat protein-encodtng nuCeic acid sequence can he 
stated v„r» or ,„ v,„, to create and,or destroy translation, initiation. and,or 
termination sequences, or to create variations in coding regions " 
restriction endonuCease sites or destroy preexisting ones, to facilitate mrther , ft™ 
H r ' tion Preterahiy snch mutations enhance the functional activity or isolatahilit, of 
or'natlve Stat protein gene product, ^ny technique for mutagenesis 

mutagenesis (Hutchinson, C. et al., IV /o, 

, , iQRA Gene 44 177 Hutchinson et al., 1986, 

1984, DNA 3:479-488; Oliphant et al., 1986, Gene 44. W / , 

, . H c;.i ilSA 83-710) use of TAB® linkers (Pharmacia), etc. PCK 

H. Erlich. ed.. Stockton Press, Chapter 6, pp. 61-70). 

,0 The identified and Isolated gene can then he inserted iiito an appropriate cloning vector, 
l a numhet of vector-hos. systems .no«n in the art ma, he used. Possihle vec„. 
iriclu e hut are not limited to, plasmids or modified viruses, but the vector system r^^s. 
: compatlhle with the host cell used. Examples of vectors mclude^hut are no limited to. 
CO,, hacteriophages snch as lamhda derivatives, or plasmids such as pBR322 
25 derivatives or pUC plasmid derivatives. e.S- . pGEX vectors, pmal-c, Pf 

lenion into a cloning vector can, for example, he accomplished h, "sa„ng t e 
f„.me„, into a cloning vector which has complemettiary cohesive t„mini Ho. r, 
the^omplementary restriction sites used to fragment the DN. are no, present n the 
clonmg vector, the ends of the DNA molecules may he enzymatically modified. 
30 Alternlively any s.te desired may he produced hy ligattng nucleotide se,uences (1 n.ers 
l : DNA teLnl. these ligated linkets may comprise speclHc chemically syn, e^,.d 

0 gonucleotides encoding restriction endonuclease recogniiion sequences. Pecomh^nt 

1 ecules can he introduced into host cells v,a iransformation. 

elecroporatlon. etc.. so that man, copies of the gene sequence are generated. Preferably. 
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„e c,„™d gen. ,s c„„,ai,«. on a shn,„e v«.„r p,asn,i.. w.'.ch provides for expansion ,n 
a doning ceM, E. coU. an. faOe pnr.ficar.on for .ub.e,uen, Inserrlon ,„ro an 
.ppropriare expression ce„ ,ine. if sue. is des.re. For exa^pie, a s.n,„e vecor, w.c. 

L veero, rhar can repHcare in n,ore .han on. iyp. of orgarrisnr. can .e pr.par d for 
; rep„ear,on in £ coU and Sncc,.ro,.yc. by ,ir*,ng sequences fron, an 

coli plasraid with sequences form the yeas. 2^ plastnid. 

,„ an a„erna.iv. n,e,bod, rhe des.red g.n. nra, be rdenrrfied and rsolared afer rnserrion 
inro a su„ab>e cloning vecror in a "shoe gun" approach. Enrich^.n. for ,h. d»,red gene, 
,0 for exa.p,.. by si., fracionarion, can be don. before rnserrion in.o .he Conrng vecror. 

' Fv prfssion of Stat Proteins 
The nucieoride sequence coding for a S.af prorern, or ^ncriona, fragnrem. including .he 
,„„ca,ed S,ar prorein and ,he N..r.lnal pepfide fragnaenr of a Sra, pror.in, d.r.var.ves or 

expression vecor. i.e.. a vector which contains the necessary elements for the 
„Lr ,p,.o„ and translation of the ,ns.n.d pro.c.n-coding sequence. Such elements are 
termed herem a -promoter." Thus, the nucleic acid encoding a Stat prote.n of t e 
invention or ..ncttonal fragment, tnCudtng the truncated Stat protetn and the N ermtnal 
20 pepttde fragment of a Stat protein, d.r.v.tives or analogs thereof, is operation.,, 

assocated wtth a promoter in an expression vecor of the invention. Both cONA and 
genomic sequences can be cloned and expressed under control of such regulator, 
sequences. An .xpr.ssion v.c,or also preferably includes a replication origin. The 
necessary transcripr.ona, and translationa, signals can be provided on a 
25 expression vector. As detailed below, all genetic manipulations described for the S. at 
.en. in this section, may also b. employed tor genes encoding a functronal fragment, 
mcluding the truncated Stat prorein and the N-temrmal peptide fragnaent of a Stat protem. 
derivatives or analogs thereof, including a chimeric protein, thereof. 

30 potential host-vector systems include but are not limited to mammalian cell systems 
infected with vir.s „ g., vaccrma vt^s, adenovirus, etc.; insect cell systems infected 
With virus ,e.,., baculovirus,; microorganisms such as yeast containing V-'-^^^'y' 
bacteria rransformed with bact.rtophag., DNA, plasmid DNA, or cosmid DNA. The 
expression elements of vectors vary m their strengths and specificities. Depending on the 
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host-vector system utilized, any one of a number of suitable transcription and translation 

elements may be used. 

A recombinant Stat protein of the mvention, may be expressed chromosomally, after 
mtegration of the codmg sequence by recombination. In th.s regard, any ot a number of 
amplitlcat.on systems may be used to achieve h.gh levels of stable gene expression (See 
Sambrook et al., 1989, supra). 

The cell into wh.ch the recombinant vector comprising the nucleic acid encoding Stat 
10 protein is cultured m an appropriate cell culture medium under conditions that provide for 
expression of Stat protein by the cell. 

Any of the methods previously described for the insertion of DNA fragments into a 
cloning vector may be used to construct expression vectors contaming a gene consisting of 
15 appropriate transcriptional/translational control signals and the protein coding sequences. 
These methods may include in vitro recombinant DNA and synthetic techniques and m 
vivo recombination (genetic recombination). 

Expression of Stat protein may be controlled by any promoter/enhancer element known in 
20 the art, but these regulatory elements must be functional in the host selected tor 

expression. Promoters which may be used to control Stat protein gene expression include, 
but are not limited to, the SV40 early promoter region (Benoist and Chambon. 1981, 
Nature 290-304-310), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), the herpes thymidine kinase 
25 promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the 

regulatory sequences of the metallothionein gene (Brinster et al., 1982, Nature 296:39- 
42)- prokaryotic expression vectors such as the ^3-lactamase promoter (X'ilia-Kamaroff. et 
al.. 1978, Proc. Natl. Acad. Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer, et 
al 1983 Proc Natl. Acad. Sci. U.S.A. 80:21-25); see also "Useful proteins from 
30 recombinant bacteria" in Scientific American, 1980, 242:74-94; promoter elements from 
yeast or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) 
promoter PGK (phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and 
the animal transcriptional control regions, which exhibit tissue specificity and have been 
utilized in transgenic animals: elastase I gene control region which is active in pancreatic 
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acinar cells (Sw.ft et al.. 1984. Cell 38:639-646; Ornitz et al.. 1986. Cold Spnng Harbor 
Symp Quant. Biol. 50:399-409; MacDonald. 1987. Hepatology 7:425-515); .nsulin gene 
control region which is active in pancreatic beta cells (Hanahan. 1985. Nature 315:115- 
P^) in^unoglobuHn gene control reg.on which is act.ve m lympho.d cells (Grosschedl et 
5 al 1984. Cell 38:647-658; Adames et al.. 1985, Nature 3 18:533-538; Alexander et al.. 
1987 Mol Cell. Biol. 7:1436-1444), mouse mammary tumor virus control region which 
,s active in testicular, breast, lymphoid and mast cells (Leder et al., 1986. Cell 45:485- 
495), albumin gene control region which is active in Hver (Pinkert et al.. 1987. Genes and 
Devel 1-268-276), alpha-fetoprotein gene control region which is active m hver 
10 (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et al., 1987, Science 

235:53-58), alpha 1-antitrypsin gene control region which is active m the liver (Kelsey et 
al 1987 Genes and Devel. 1: 161-171), beta-globin gene comrol region which is active 
in myeloid cells (Mogram et al., 1985. Nature 315:338-340; Kollias et al.. 1986. Cell 
46-89-94) myelin basic protein gene control region which is active in oligodendrocyte 
,5 cells in the bram (Readhead et al., 1987, Cell 48:703-712), myosin light chain-2 gene 
control region which is active in skeletal muscle (Sani, 1985, Nature 314:283-286). and 
gonadotropic releasing hormone gene control region which is active in the hypothalamus 
(Mason et al., 1986. Science 234:1372-1378). 

20 Expression vectors containing a nucleic acid encoding a Stat protein of the invention can 
be identified by four general approaches: (a) PCR amplification of the desired plasmid 
DNA or specific mRNA, (b) nucleic acid hybridization, (c) presence or absence of 
selection marker gene functions, and (d) expression of inserted sequences. In the first 
approach, the nucleic acids can be amplified by PCR to provide for detection ot the 
25 amplified product. In the second approach, the presence of a foreign gene inserted in an 
expression vector can be detected by nucleic acid hybridization using probes comprising 
sequences that are homologous to an inserted marker gene. In the third approach, the 
recombinant vector./host system can be identified and selected based upon the presence or 
absence of certain "selection marker" gene functions (e.g., /3-galactosidase activity, 
30 thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion 
body formation in baculovirus, etc.) caused by the insertion of foreign genes in the vector. 
In another example, if the nucleic acid encoding Stat protein is inserted within the 
■■selection marker" gene sequence of the vector, recombinants containing the Stat protein 
insert can be identified by the absence of the Stat protein gene function. In the fourth 
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approach, recombinant expression vectors can be identified by assaying for the activity, 
biochemical, or immunological characteristics of the gene product expressed by the 
recombinant, provided that the expressed protein assumes a functionally active 
conformation. 

5 

A wide variety of host/expression vector combinations may be employed in expressing the 
DNA sequences of this invention. Useful expression vectors, for example, may consist of 
segments of chromosomal, nonchromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli plasmids 

10 col El. pCRl, PBR322, pMal-C2, pET, pGEX (Smith et al., 1988, Gene 67:31-40), 
pMB9 and their derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous 
derivatives of phage X, e.g., NM989. and other phage DNA, e.g.. M13 and filamentous 
single stranded phage DNA; yeast plasmids such as the 2, plasmid or derivatives thereof; 
vectors useful in eukaryotic cells, such as vectors useful in insect or mammalian cells; 

15 vectors derived from combinations of plasmids and phage DNAs, such as plasmids that 
have been modified to employ phage DNA or other expression control sequences; and the 
like. 

For example, in a baculovirus expression systems, both non-fosion transfer vectors, such 
20 as but not limited to pVL941 iBamHl cloning site; Summers), pVL1393 (5amHl, SmaL 
Xbal EcoRh Noth XmaWl, BglW, and Pstl clomng site; Invitrogen). pVL1392 (Bgllh 
Psth Noth Xmallh EcoRl, Xbal. Smal, and BamUl cloning site; Summers and 
Inviirogen). and pBlueBadll iBamHl, Bglll, Pstl, Ncol, and Mil cloning site, with 
blue/white recombinant screening possible; Invitrogen), and fusion transfer vectors, such 
25 as but not limited to pAc700 (BamHI and Kpnl cloning site, in which the BamHl 

recognition site begins with the initiation codon; Summers), pAc701 and pAc702 (same as 
pAc700, with different reading frames), pAc360 KBamYll cloning sit,e 36 base pairs 
downstream of a polyhedron initiation codon; Invitrogen(195)), and pBlueBacHisA, B, C 
(three different reading frames, with Bamm, BglM, Pstl, Ncol, and mndlll cloning site. 
30 an N-terminal peptide for ProBond purification, and blue/white recombinant screening of 
plaques; Invitrogen (220)) can be used. 

Mammalian expression vectors contemplated for use in the invention include vectors with 
inducible promoters, such as the dihydrofolate reductase (DHFR) promoter, e.g., any 
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expression vector with a DHFR expression vector, or a D//F-R/methotrexate co- 
ampHfication vector, such as pED iPstl. Sail Sbal, Smal, and EcoRI cloning site, wuh 
the vector expressing both the cloned gene and DHFR: see Kaufman. Current Protocols m 
Molecular Biology. 16. 12 ( 1991). Alternatively, a glutamine synthetase/methionine 
sulfoximine co-amplification vector, such as pEEl4 (Mil, Xbal. Smal Sbal, EcoRl, 
and Bell cloning site, in which the vector expresses glutamine synthase and the cloned 
gene; Celltech). In another embodiment, a vector that directs episomal expression under 
Ltrol of Epstein Barr V.rus (EBV) can be used, such as pREP4 iBamHl, Sfil XhoX, 
NotU Nheh Hindlll Nhel Pvull and Kpnl cloning site, constitutive RSV-LTR promoter, 
hygromycin selectable marker; Invitrogen), pCEP4 (BamHl, Sfil Xhol Notl Nhel 
Hindlll Nhel Pvull and Kpnl cloning site, constitutive hCMV immediate early gene, 
hygromycin selectable marker; Invitrogen). pMEP4 ^Kpnl Pvul Nhel Hindlll Notl 
Xhol Sfil Bamm cloning site, inducible methallothionein Ila gene promoter, hygromycin 
selectable marker: Invitrogen), pREPS (BamUl Xhol Notl Hindlll Nhel and Kpnl 
cloning site, RSV-LTR promoter, histidinol selectable marker; Invitrogen), pREP9 (Kpnl 
Nhel ^Hindlll Notl Xhol Sfil and BamHI cloning sue, RSV-LTR promoter, G418 
selectable marker; Invitrogen), and pEBVHis (RSV-LTR promoter, hygromycin selectable 
marker, N-terminal peptide purif.able via ProBond resin and cleaved by enterokmase; 
Invitrogen). Selectable mammalian expression vectors for use in the invention include 
pRc/CMV (Hindlll BstXl Notl Sbal and Apal cloning site, G418 selection; Invitrogen), 
pRc/RSV (Htndlll Spel BstXl Notl Xbal cloning site, G418 selection; Invitrogen), and 
others. Vaccinia virus mammalian expression vectors (see, Kaufman, 1991, supra) for 
use according to the invention include but are not limited to pSCU (Smal cloning site, 
TK- and ^-gal selection), pMJ601 (SaA, Smal Afll Narl BspMll BamUl Apal Nhel 
Sacll Kpnl and Hindlll cloning site; TK- and ^-gal selection), and pTKgptFlS (EcoRI, 
Pstl Sail Accl Hindll Sbal BamHI and Hpa cloning site, TK or XPRT selection). 



Yeast expression systems can also be used according to the invention to express OB 
polypeptide. For example, the non-ft.sion pYES2 vector (Xbal Sphl Shol Notl GstXl 
30 EcoRl BstXl BamHU Sad Kpnl and Hindlll cloning sit; Invitrogen) or the fusion 

pYESHisA, B. C (Xbal Sphl Shol Notl BstXl EcoRl BamUl, Sad Kpnl and Hindlll 
cloning site. N-terminal peptide purified with ProBond resin and cleaved with 
enterokinase; Invitrogen), to mention just two, can be employed according to the present 
invention. 
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Once a particular recombinant DNA molecule is identified and isolated, several methods 
known in the art may be used to propagate it. Once a suitable host system and growth 
conditions are established, recombinant e.xpression vectors can be propagated and prepared 
in quantity. As previously explained, the expression vectors which can be used include, 
but are not limited to. the following vectors or their derivatives: human or animal viruses 
such as vaccinia virus or adenovirus; insect viruses such as baculovirus; yeast vectors; 
bacteriophage vectors (..,., lambda), and plasmid and cosmid DNA vectors, to name but 



a few. 
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Vectors are introduced into the desired host cells by methods known in the art, e.g., 
transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, 
calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun, or a 
DNA vector transporter (see, e.g., Wu et al.. 1992, J. Biol. Chem. 267:963-967; Wu and 
Wu, 1988, J. Biol. Chem. 263:14621-14624; Hartmut et al.. Canadian Patent Application 
15 No. 2,012,311, filed March 15, 1990). 



General Protein Purification Procedures: 



Initial steps for purifying the proteins of the present invention include salting in or salting 
.0 out such as in ammonium sulfate fractionations; solvent exclusion fractionations, an 
ethanol precipitation; detergent extractions to free membrane bound proteins using such 
detergents as Triton X-100, Tween-20 etc.; or high salt extractions. Solubilization of 
proteins may also be achieved using aprotic solvents such as dimethyl sulfoxide and 
hexamethylphosphoramide. In addition, high speed ultracentrifugation may be used either 
25 alone or in conjunction with other extraction techniques. 

Generally good secondary isolation or purification steps include solid phase absorption 
using calcium phosphate gel or hydroxyapatite; or solid phase binding. Solid phase 
binding may be performed through ionic bonding, with either an anion exchanger, such as 
30 diethylaminoethyl (DEAE), or diethyl [2-hydroxypropyl] aminoethyl (QAE) SEPHADEX 
or cellulose; or with a cation exchanger such as carboxymethyl (CM) or sulfopropyl (SP) 
SEPHADEX or cellulose. Alternative means of solid phase binding includes the 
exploitation of hydrophobic interactions e.g. . the using of a solid support such as 
phenylSEPH AROSE and a high salt buffer; aff.nity-binding, using, placing a specific 
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DNA binding site of a Stat protein to an activated support: .mmuno-b.ndmg. usmg ..g., 
an antibody to the Stat protein bound to an activated support; as we,, as other solid phase 
supports including those that contain specific dyes or lectins etc. A further solid phase 
support technique that is often used at the end of the purification procedure relies on size 
5 exclusion, such as SEPHADEX and SEPHAROSE gels, or pressurized or centrifugal 
membrane techniques, using size exclusion membrane filters. 

solid phase support separations are generally performed batch-wise with low-speed 
centrifugations or by column chromatography. High performance liquid chromatography 
10 (HPLC) including such related techniques as FPLC, is presently the most common means 
of performing liquid chromatography. Size exclusion techniques may also be 
accomplished with the aid of low speed centrifugation. 

,n addition size permeation techniques such as gel electrophoretic techniques may be 
15 employed. These techniques are generally performed in tubes, slabs or by capillary 

electrophoresis. 

Almost all steps involving protein purification employ a buffered solution. Unless 
otherwise specified, generally 25-100 mM concentrations are used. Low concentration 
20 buffers generally infer 5-25 mM concentrations. High concentration buffers generally 
infer concentrations of the buffering agent of between 0. 1-2M concentrations. Typical 
buffers can be purchased from most biochemical catalogues and include the classical 
buffers such as Tris, pyrophosphate, monophosphate and diphosphate. The Good buffers 
[Good N.E., et al..(1966) Biochemistry, 5, 467; Good, N.E. and Izawa, S., (197_) 
25 Meth. Enzymol., 24, Part B, 53; and Fergunson, W.J. and Good, N. E., (1980) Anal. 
B.ochem. 104. 300.] such as Mes. Hepes. Mops, tricine and Ches. 

1 

Materials to perform all of these techniques are available from a variety of sources such as 
Sigma Chemical Company in St. Louis, Missouri. 

30 

S ynthetic P ^iyp'-ptides and Frapments Thereof 

The term "polypeptide" is used in its broadest sense to refer to a compound of two or 
more subunit amino acids, amino acid analogs, or pept.domimetics. The subunits may be 
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linked by peptide bonds. In another embodiment, the subunit may be linked by other the 
bonds, e.s-. ester, ether, etc. As used herein the term "amino acid" refers to either 
natural and/or unnatural or synthetic amino acids, including glycine and both the D or L 
optical isomers, and amino acid analogs and peptidomimetics. A peptide of three or more 
5 amino acids is commonly called an oligopeptide if the peptide chain is short. If the 
peptide chain is long, the peptide is commonly called a polypeptide or a protein. 

The Stat proteins and active fragments thereof, including the truncated Stat protein of the 
present invention may be chemically synthesized. In addition, potential drugs that may be 
10 tested in the drug screening assays of the present invention may also be chemically 

synthesized. Synthetic polypeptides, prepared using the well known techniques of solid 
phase, liquid phase, or peptide condensation techniques, or any combination thereof, can 
include namral and unnatural amino acids. Amino acids used for peptide synthesis may be 
standard Boc (N«-amino protected N«-t-butyloxycarbonyl) amino acid resin with the 
15 standard deprotecting, neutralization, coupling and wash protocols of the original solid 
phase procedure of Merrifield (1963, J. Am. Chem. Soc. 85:2149-2154). or the base- 
labile N"-amino protected 9-fluorenylmethoxycarbonyl (Fmoc) amino acids first described 
by Carpino and Han (1972, J. Org. Chem. 37:3403-3409). Both Fmoc and Boc N"-amino 
protected amino acids can be obtained from Fluka, Bachem, Advanced Chemtech, Sigma. 
20 Cambridge Research Biochemical, Bachem, or Peninsula Labs or other chemical 

companies familiar to those who practice this art. In addition, the method of the invention 
can be used with other N"-protecting groups that are familiar to those skilled in this art. 
Solid phase peptide synthesis may be accomplished by techniques familiar to those in the 
art and provided, for example, in Stewart and Young, 1984, Solid Phase Synthesis, 
25 Second Edition, Pierce Chemical Co., Rockford, IL; Fields and Noble, 1990, Int. J. Pept. 
Protein Res. 35:161-214, or using automated synthesizers, such as sold by ABS. Thus, 
polypeptides of the invention may comprise D-ammo acids, a combination oi D- and L- 
amino acids, and various "designer" amino acids (e.g., ^-methyl amino acids, Ca-methyl 
amino acids, and Na-methyl amino acids, etc.) to convey special properties. Synthetic 
30 amino acids include ornithine for lysine, fluorophenylalanine for phenylalanine, and 

norleucine for leucine or isoleucine. Additionally, by assigning specific amino acids at 
specific coupling steps, a-helices, /3 turns, /3 sheets. 7-turns. and cyclic peptides can be 
generated. 
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in a ftirther embodhnent, subunits of peptides that confer useful chemical and structural 
properties will be chosen. For example, peptides comprising D-amino acids will be 
resistant to L-ammo acid-specif.c proteases m v.v.. In addition, the present invention 
envisions preparing peptides that have more well defined structural properties, and the use 

5 of peptidomimetics. and peptidomimetic bonds, such as ester bonds, to prepare peptides 
with novel properties. In another embodiment, a peptide may be generated that 
incorporates a reduced peptide bond, i.e., R,-CH,-NH-R„ where R, and R, are amino 
acid residues or sequences. A reduced peptide bond may be introduced as a dipeptide 
subunit. Such a molecule would be resistant to peptide bond hydrolysis, e.g., protease 

10 activity. Such peptides would provide l.gands with unique function and activity, such as 
extended half-lives in vivo due to resistance to metabolic breakdown, or protease activity. 
Furthermore it is well known that in certain systems constrained peptides show enhanced 
functional activity (Hruby, 1982, Life Sciences 31-.189-199; Hruby et al., 1990. Biochem 
J. 268:249-262); the present invention provides a method to produce a constrained peptide 

15 that incorporates random sequences at all other positions. 

Constrained and cyclic peptides. A constrained, cyclic or rigidized peptide may be 
prepared synthetically, provided that in at least two positions in the sequence of the 
peptide an amino acd or amino acid analog is inserted that provides a chemical functional 
20 group capable of crosslinking to constrain, cyclise or rigidize the peptide after treatment to 
form the crosslink. Cyclization will be favored when a mrn-inducing amino acid is 
incorporated. Examples of amino acids capable of crosslinking a peptide are cysteine to 
form disulfides, aspartic acid to form a lactone or a lactam, and a chelator such as 
^-carboxyl-glutamic acid (Gla) (Bachem) to chelate a transition metal and form a cross- 
25 link. Protected 7-carboxyl glutamic acid may be prepared by modifying the synthesis 
described by Zee-Cheng and Olson (1980, Biophys. Biochem. Res. Commun. 94:1128- 
1132). A peptide in which the peptide sequence comprises at least two ammo acids 
capable of crosslinking may be treated, e.g., by oxidation of cysteine residues to form a 
disulfide or addition of a metal ion to form a chelate, so as to crosslink the peptide and 
30 form a constrained, cyclic or rigidized peptide. 

The present invention provides strategies to systematically prepare cross-links. For 
example, if four cysteine residues are incorporated in the peptide sequence, different 
protecting groups may be used (Hiskey, 1981, in The Peptides: Analysis, Synthesis. 
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Biology, Vol. 3, Gross and Meienhofer, eds.. Academic Press: New York. pp. 137-167; 
Ponsanti et al.. 1990. Tetrahedron 46:8255-8266). The first pair of cysteines may be 
deprotected and o.>cidized, then the second set may be deprotected and oxidized. In this 
way a defined set of disulfide cross-links may be formed. Alternatively, a pair of 
5 cysteines and a pair of chelating amino acid analogs may be incorporated so that the cross- 
links are of a different chemical nature. 

Non-classical amino acids that induce conformational constraints. The following non- 
■ classical amino acids may be incorporated in the peptide in order to introduce particular 

10 conformational motifs: i.2,3,4-tetrahydroisoquinoline-3-carboxylate (Kazmierski et al., 
1991, J. Am. Chem. Soc. 113:2275-2283); (2S,3S)-methyl-phenylalanine, (2S,3R)-methyl- 
phenylalanine, (2R,3S)-methyl-phenylalanine and (2R.3R)-methyl-phenylalanine 
(Kazmierski and Hruby, 1991, Tetrahedron Lett.); 2-aminotetrahydronaphthalene-2- 
carboxylic acid (Landis, 1989. Ph.D. Thesis, University of Arizona); hydro.xy-1.2,3,4- 

15 tetrahydroisoquinoline-3-carboxylate (Miyake et al., 1989, J. Takeda Res. Labs. 43:53- 
76)- ^3-carboUne (D and L) (Kazmierski. 1988, Ph.D. Thesis, University of Arizona); HIC 
(histidine isoquinolme carboxylic acid) (Zechel et al., 1991, Int. J. Pep. Protein Res. 43); 
and HIC (histidine cyclic urea) (Dharanipragada). 

20 The following amino acid analogs and peptidom.metics may be incorporated into a peptide 
to induce or favor specific secondary structures: LL-Acp (LL-3-amino-2-propenidone-6- 
carboxylic acid), a ^-turn inducing dipeptide analog (Kemp et al., 1985. J. Org. Chem. 
50:5834-5838); ^-sheet inducing analogs (Kemp et al., 1988, Tetrahedron Lett. 29:5081- 
5082)- ^-turn inducing analogs (Kemp et al., 1988, Tetrahedron Lett. 29:5057-5060); 
25 « -helix inducing analogs (Kemp et al. . 1988, Tetrahedron Lett. 29:4935-4938); T-turn 
inducing analogs (Kemp et al., 1989, J. Org. Chem. 54:109:115); and analogs provided 
by the following references: Nagai and Sato, 1985, Tetrahedron LeU. 26:647-650; 
DiMaio et al., 1989, J. Chem. Soc. Perkin Trans, p. 1687; also a Gly-Ala turn analog 
(Kahn et al.. 1989, Tetrahedron Lett. 30:2317); amide bond isostere (Jones et al., 1988, 
30 Tetrahedron Lett. 29:3853-3856); tretrazol (Zabrocki et al., 1988, J. Am. Chem. Soc. 
110-5875-5880); DTC (Samanen et al., 1990, Int. J. Protein Pep. Res. 35:501:509); and 
analogs taught in Olson et al., 1990, J. Am. Chem. Sci. 112:323-333 and Garvey et al.. 
1990 J. Org. Chem. 56:436. Conformationally restricted mimetics of beta turns and beta 
bulges, and peptides contaimng them, are described m U.S. Patent No. 5.440,013, issued 
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August 8, 1995 to Kahn. 

Derivatized and modified peptides. The present invention further provides for 
modification or derivatization of a peptide of the invention. Modifications of peptides are 
5 well known to one of ordinary skill, and include phosphorylation, carboxymethylation. 
and acylation. Modif-ications may be effected by chemical or enzymatic means. 

In another aspect, glycosylated or fatty acylated peptide derivatives may be prepared. 
Preparation of glycosylated or fatty acylated peptides is well known in the art as 
10 exemplified by the following references: 

1. Garg and Jeanloz, 1985, in Advances in Carbohydrate Chemistry and 
Biochemistry, Vol. 43, Academic Press. 

2. Kunz, 1987, in Ang. Chem. Int. Ed. English 26:294-308. 

3. Horvat et al., 1988, Int. J. Pept. Protein Res. 31:499-507. 
15 4. Bardaji et al., 1990, Ang. Chem. Int. Ed. English, 23:231. 

5. Toth et al., 1990, in Peptides: Chemistry, Structure and Biology, Rivier 
and Marshal, eds., ESCOM Publ., Leiden, pp. 1078-1079. 

6. Torres et al., 1989, Experientia 45:574-576. 

7. Torres et al., 1989, EMBO J. 8:2925-2932. 

20 8. Hordever and Musiol, 1990, in Peptides: Chemistry, Structure and 

Biology, loc^ ciL, PP- 811-812. 

9. Zee-Cheng and Olson, 1989, Biochem. Biophys. Res. Commun. 94:1128- 

1132. 

10. Marki et al., 1977, Helv. Chem. Acta., 60:807. 

25 11. Fuju et al. 1987, J. Chem. Soc. Chem. Commun., pp. 163-164. 

12. Ponsati et al., 1990, Peptides 1990, Giralt and Andrea, eds., ESCOM 
Publ.. pp. 238-240. 

13. Fuji et al.. 1987, 1988, Peptides: Chemistry and Biology, Marshall, ed., 
ESCOM Publ., Leiden, pp. 217-219. 

30 

There are two major classes of peptide-carbohydrate linkages. First, ether bonds join the 
serine or threonine hydroxyl to a hydroxyl of the sugar. Second, amide bonds join 
glutamate or aspartate carboxyl groups to an amino group on the sugar. In particular, 
references 1 and 2. supra, teach methods of preparing peptide-carbohydrate ethers and 
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amides. Acetal and ketal bonds may also bind carbohydrate to peptide. 

Fatty acyl peptide derivat.ves may also be prepared. For example, and not by way of 
limitation, a free amino group (N-termtnal or lysyl) may be acylated. e.g.. mynstoylated. 
in another embodiment an ammo acid comprising an aliphatic stde chain of the structure - 
(CH0„CH3 may be incorporated in the peptide. This and other peptide-fatty acid 
conjugates suitable for use in the present invention are disclosed in U.K. Patent GB- 
8809162.4, International Patent Application PCT/AU89/00166, and reference 5. supra. 



10 Phage libraries for Drug Screening. 

Phage libraries have been constructed which when infected into host E. coli produce 
random peptide sequences of approximately 10 to 15 amino acids [Parmley and Smith, 
Gene 73:305-318 (1988), Scott and Smith, Science 249:386-249 (1990)]. Specifically, the 
phage library can be mixed in low dilutions with permissive E. coU in low melting point 
15 LB agar which is then poured on top of LB agar plates. After incubating the plates at 
37°C for a period of time, small clear plaques in a lawn of £. coli will form which 
represents active phage growth and lysis of the E. coli. A representative of these phages 
can be absorbed to nylon filters by placing dry filters onto the agar plates. The filters can 
be marked for orientation, removed, and placed in washing solutions to block any 
20 remaining absorbent sites. The filters can then be placed in a solution containing, for 
example, a radioactive N-terminal peptide fragment of a Stat protein ie.g.. the fragment 
having the amino acid sequence of SEQ ID NO:4). After a specified inc^at ion period, 
the filters can be thoroughly washed and developed for autoradiography. ^9^^ 
containing the phage that bind to the radioactive N-terminal peptide fragment of a Stat 
25 protein can then be identified. These phages can be ftirther cloned and then retested for 
their abilitv to bind to the N-terminal peptide fragment of a Stat protein as before. Once 
the phages have been purified, the binding sequence contained within the phage can be 
determined by standard DNA sequencing techniques. Once the DNA sequence is known, 
synthetic peptides can be generated which represents these sequences. 

These peptides can be tested, for example, for their ability to: (1) interfere with a Stat 
protein binding to its DNA binding site; and (2) interfere with a truncated Stat protein 
binding to the DNA binding site. If the peptide interferes in the first case but does not 
mterfere in the latter case, it may be concluded that the peptide interferes with N-term.nal 
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inter-protein interaction of Stat proteins. 



10 



15 



The effecttve peptide(s) can be synthesized in large quantities for use tn viv. models 
and eventually .n humans to prevent modulate signal transduction. It should be 
emphasized that synthetic peptide production is relatively non-labor intensive, easily 
n^anufactured. quality controlled and thus, large quantities of the desired product can be 
produced quite cheaply. Similar combinations of mass produced synthetic peptides have 
recently been used with great success [Patarroyo, Vaccine 10:175-178 (1990)]. 

Anfihodies t o the Truncate d Stat Protein 
According to the present invention, the truncated Stat protein_as purified from recombinant 
sources or produced by chemical synthesis, and derivatives or analogs thereof, including 
fusion proteins, may be used as an immunogen to generate antibodies that recognize the 
truncated Stat protein. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and a Fab expression library. The 
anti-truncated Stat protein antibodies of the invention may be cross reactive, that is, they 
n.ay recognize the truncated Stat protein derived from different natural Stat proteins such 
as Human Static, Human Stat 6 or a Drosoph.la Stat protein. Polyclonal antibodies have 
greater likelihood of cross reactivity. Alternatively, an antibody of the invention may be 
specific for a single form of the truncated Stat, such as the Human Statltc having an 
amino acid sequence of SEQ ID N0:3. 

various procedures known in the art may be used for the production of polyclonal 
antibodies to the truncated Stat protein or derivative or analog thereof. For the production 
; of antibody, various host animals can be immunized by injection with the truncated Stat 
protein, or a derivative (e.g., or fusion protein) thereof, including but not limited to 
rabbits, mice. rats, sheep, goats, etc. In one embodiment, the truncated Stat protein can 
be conjugated to an imn^unogenic carrier, e.g., bovine serum albumin (BSA) or keyhole 
lin^pet hemocyanin (KLH). Various adjuvants may be used to increase the immunological 
0 response, depending on the host species, including but not limited to Freund's (complete 
and incomplete), mineral gels such as aluminum hydroxide, surface active substances such 
as lysolecithin, pluromc polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanins. dinitrophenol, and potentially useful human adjuvants such as BCG (baalle 
Calmette-Guerin) and Corynebacterium parvum. 
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For preparation of monoclonal antibodies directed toward the truncated Stat protein, or 
analog, or derivative thereof, any technique that provides for the production of antibody 
molecules by continuous cell lines in culture may be used. These include but are not 
limited to the hybndoma technique originally developed by Kohler and Milstein [Nature 
5 256:495-497 (1975)], as well as the trioma technique, the human B-cell hybndoma 
technique [Kozbor et al.. Immunology Today 4:72 1983); Cote et al., Proc. Natl. Acad. 
Sci. U.S.A. 80:2026-2030 (1983)], and the EBV-hybridoma technique to produce human 
monoclonal antibodies [Cole et al., in Monoclonal Antibodies and Cancer Therapy, Alan 
R. Liss, Inc., pp. 77-96 (1985)]. In an additional embodiment of the invention, 
10 monoclonal antibodies can be produced in germ-free animals utilizing recent technology 
[PCT/US90/02545]. In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" [Morrison et al., /. Bacteriol. 159:870 (1984); 
Neuberger et al.. Nature 312:604-608 (1984); Takeda et al.. Nature 314:452-454 (1985)] 
by splicing the genes from a mouse antibody molecule specific for an truncated Stat 
15 protein together with genes from a human antibody molecule of appropriate biological 
activity can be used; such antibodies are within the scope of this invention. Such human 
or humanized chimeric antibodies are preferred for use in therapy of human diseases or 
disorders (described infra), since the human or humanized antibodies are much less likely 
than xenogenic antibodies to induce an immune response, in particular an allergic 
20 response, themselves. 

According to the invention, techniques described for the production of single chain 
antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. Patent 4,946,778] 
can be adapted to produce truncated Stat protein-specific single chain antibodies. An 
25 additional embodiment of the invention utilizes the techniques described for the 

construction of Fab expression libraries [Huse et al.. Science 246:1275-1281 (1989)] to 
allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity for a truncated Stat protein, or its derivatives, or analogs. 

30 Antibody fragments which contain the idiotype of the antibody molecule can be generated 
by known techniques. For example, such fragments include but are not limited to: the 
F(ab')2 fragment which can be produced by pepsin digestion of the antibody molecule; the 
Fab' fragments which can be generated by reducing the disulfide bridges of the F(ab'), 
fragment, and the Fab fragments which can be generated by treating the antibody molecule 
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with papain and a reducing agent. 

,n the producfon of ant.bodies. screening for the desired ant.body can be accomplished by 
techniques known in the art, e.g., rad.oimmunoassay, ELISA (enzyme-lmked 
5 immunosorbant assay), "sandwich" immunoassays, immunoradiometnc assays, gel 
diffusion precipitin reactions, immunodiffusion assays, in sua immunoassays (usmg 
colloidal gold, enzyme or radioisotope labels, for example), western blots, prec.p.tation 
reactions, agglutination assays {e.g.. gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, protein A assays, and 
10 immunoelectrophoresis assays, etc. In one embodiment, antibody binding is detected by 
detecting a label on the primary antibody. In another embodiment, the primary ant.body 
is detected by detecting binding of a secondary antibody or reagent to the primary 
antibody. In a further embodiment, the secondary antibody is labeled. Many means are 
known in the art for detecting binding in an immunoassay and are within the scope of the 
15 present invention. For example, to select antibodies which recognize a specific epitope ot 
the truncated Stat protein, one may assay generated hybridomas for a product which binds 
to the truncated Stat protein fragment containing such epitope. For selection of an 
antibody specific to the truncated Stat protein from a particular source, one can select on 
the basis of positive binding with truncated Stat protein expressed by or isolated from that 
20 specific source. 

The foregoing antibodies can be used in methods known in the art relating to the 
localization and activity of the truncated Stat protein, for Western blotting, imaging 
truncated Stat protein in situ, measuring levels thereof in appropriate physiological 
25 samples, etc. using any of the detection techniques mentioned above or known in the art. 

In a specific embodiment, antibodies that agomze or antagonize the activity of truncated 
Stat protein can be generated. Such antibodies can be tested using the assays described 
infra for identifying ligands. 
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Labels : 

Suitable labels include enzymes, fluorophores (e.g.. fluorescene isothiocvanate (FITC), 
phycoerythrin (PE), Texas red (TR), rhodamine, free or chelated lanthanide series salts, 
especially Eu^^ to name a few fluorophores), chromophores. radioisotopes, chelating 
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agents, dyes, collo.dal gold, latex particles. Hgands (e.g., b.otin). and chemiluminescent 
agents. When a control marker is employed, the same or different labels may be used for 
the receptor and control marker. 

In the instance where a radioactive label, such as the isotopes ^H. '^C, ^=P, '^S, ^"Cl, "Cr. 
"Co '*Co '■'Fe, *Y, "^1, "'I, and "*Re are used, known currently available counting 
procedures may be utilized. In the instance where the label is an enzyme, detection may 
be accomplished by any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques known in the art. 
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Direct labels are one example of labels which can be used according to the present 
invention. A direct label has been defined as an entity, which in its natural state, is 
readily visible, either to the naked eye, or with the aid of an optical filter and/or applied 
stimulation, e.g. U.V. light to promote fluorescence. Among examples of colored labels, 
15 which can be used according to the present invention, include metallic sol particles, for 
example gold sol particles such as those described by Leuvering (U.S. Patent 4,313,734); 
dye sole particles such as described by Gribnau et al. (U.S. Patent 4,373,932) and May et 
al. (WO 88/08534): dyed latex such as described by May, supra, Snyder (EP-A 0 280 559 
and 0 281 327); or dyes encapsulated in liposomes as described by Campbell et al. (U.S. 
20 Patent 4,703,017). Other direct labels include a radionucleotide, a fluorescent moiety or a 
luminescent moiety. In addition to these direct labelling devices, indirect labels 
comprising enzymes can also be used according to the present invention. Various types of 
enzyme linked immunoassays are well known in the art, for example, alkaline phosphatase 
and horseradish peroxidase, lysozyme, glucose-6-phosphate dehydrogenase, lactate 
25 dehydrogenase, urease, these and others have been discussed in detail by Eva Engvall in 
Enzyme Immunoassay ELISA and EMIT m Methods in Enzymology, 70. 419-439. 1980 
and in U.S. Patent 4,857,453. 

Suitable enzymes include, but are not limited to, alkaline phosphatase and horseradish 

30 peroxidase. 

Other labels for use in the invention include magnetic beads or magnetic resonance 
imaging labels. 
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In another embodiment, a phosphorylation site can be created on an antibody of the 
invention for labelmg with "P, e.g., as described in European Patent No. 0372707 
(application No. 89311108.8) by Sidney Pestka. or U.S. Patent No. 5.459.240, issued 
October 17. 1995 lo Foxwell et al. 

As exemplified herein, proteins, including antibodies, can be labeled by metabolic 
labeling. Metabolic labeling occurs during in vitro incubation of the cells that express the 
protein in the presence of culture medium supplemented with a metabolic label, such as 
["S]-methionine or ['-P]-orthophosphate. In addition to metabolic (or biosynthetic) 
labeling with ["S]-methionine, the invention further contemplates labeling with ['"C]- 
amino acids and [^H]-amino acids (with the tritium substituted_ at non-labile positions). 



Rindinf ^ Assavs for Drug Scre ening Assays 
The drug screening assays of the present invention may use any of a number of assays for 
15 measuring the stability of a protein-protein interaction, including fragments thereof, or a 
protein-DNA binding interaction. In one embodiment the stability of preformed DNA 
protein complex between a Stat protein and its corresponding DNA binding site is 
examined as follows: the formation of a complex between the Stat protein and a labelled 
oligonucleotides is allowed to occur and unlabeled oligonucleotides are added in vast 
20 molar excess after the reaction reaches equilibrium. At various times after the addition of 
unlabelled competitor DNA. aliquots are layered on a running native polyacrylamide gel 
to determine free and bound oligonucleotides. In one preferred embodiment the protein is 
Statla, and two different labelled DNAs are used, the natural cfos site, an example of a 
"weak" site, and the mutated cfos-promotor element (M67) an example of a "strong" site 
25 as described below. Other examples of weak sites include those in the promoter of the 

MIG gene, and those in the regulatory region of the interferon-7 gene. Other examples of 
strong sites include those from the promoter of the Ly6E gene or the promoter of the 
IRF-1 gene. 

30 In other binding assays, an N-terminal fragment of the Stat protein is placed or coated 
onto a solid support. Methods for placing the N-terminal fragment on the solid support 
are well known in the art and include such things as linking biotin to the fragment and 
linking avidin to the solid support. The corresponding free N-terminal fragment is 
allowed to equilibrate with the bound fragments and drugs are tested to see if they disrupt 
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or enhance the dimer binding. Disruption leads to either a faster release of the free N- 
terminal fragment which may be expressed as a faster off time, and or a greater 
concentrafon of released fragment. Enhancement leads to either a slower release of the 
free N-terminal fragment which may be expressed as a slower off time, and or a lower 
5 concentration of released fragment. 

The N-terminal fragment may be labeled as described above. For example, in one 
embodiment radiolabled N-terminal fragments are used to measure the effect of a drug on 
binding. In another embodiment the natural ultraviolet absorbance of the free N-terminal 
10 fragments is used. In yet another embodiment, a Biocore chip (Pharmacia) coated with 
the N-terminal fragment of a Stat protein is used and the change in surface conductivity 
can be measured. 

Drug screening assays may also be performed in cells which are induced to contain 
15 activated STAT proteins, which are dimenc STAT proteins. Although cells that naturally 
encode the STAT proteins may be used, preferably a cell is used that is transfected with a 
plasmid encoding the STAT protein. For example transient transfections can be 
performed with 50% confluent U3A cells using the calcium phosphate method as 
instructed by the manufacturer (Stratagene). In addition the cells can also be modified to 
20 contain one or more reporter genes, a heterologous gene encoding a reporter such as 
luciferase, green fluorescent protein or derivative thereof, chloramphenicol acetyl 
transferase. B-galactosidase. etc. Such reporter genes can individually be operably linked 
to promoters comprising two weak STAT binding sites and/or a promoter comprising a 
strong STAT binding site. Assays for detecting the reporter gene products are readily 
25 available in the literature for example, luciferase assays can be performed according to the 
manufacturer's protocol (Promega). and i3-galactosidase assays can be performed as 
described by Ausubel ei aL, [in Current Protocols in Molecular Biology, J. Wiley & 
Sons.Inc. (1994)]. 

30 In one example, the transfection reaction can comprise the transfection of a cell with a 
plasmid modified to contain a STAT protein, such as a pcDNA3 plasmid (Invitrogen), a 
reporter plasmid that contains a first reporter gene, and a reporter plasmid that contains a 
second reporter gene. Although the preparation of such plasmids is now routine in the 
art, many appropriate plasmids are commercially available e.g., a plasmid with 0- 
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galactosidase is available from Stratagene. 

The reporter piasmids can contain specific restriction sites in which an enhancer element 
having a strong STAT binding site or alternatively two tandemly arranged "weak" STAT 
binding sites are inserted. In one particular embodiment, thirty-six hours after transfection 
of the cells with a plasmid encoding STAT-1, the cells are treated with 5 ng/ml 
interferon-7 Amgen for ten hours. Protein expression and tyrosine phosphorylation (to 
monitor STAT activation) can be determined by e.g.. gel shift experiments with whole cell 
extracts. 
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Cells containing a STAT protein and a reporter gene that is operably linked to a promoter 
comprising two weak STAT binding sites can be contacted with a prospective drug in the 
presence of a cytokine which activates the STAT(s) of interest. The amount of reporter 
produced in the absence and presence of prospective drug is determined and compared. 

15 Prospective drugs which reduce the amount of reporter produced are candidate antagonists 
of the N-terminal interaction, whereas prospective drugs which increase the amount of 
reporter produced are candidate agonists. Cells containing a reporter gene operably linked 
to a promoter comprising a strong STAT binding site are then contacted with these 
candidate drugs, in the presence of a cytokine which activates the STAT(s) of interest. 

20 The amount (and/or activity) of reporter produced in the presence and absence of 

candidate drugs is determined and compared. Drugs which disrupt interactions between 
the N-terminal domains of the STATs will not reduce reporter activity in this second step. 
Similarly, candidate drugs which enhance interactions between N-terminal domains of 
STATs will not increase reporter activity in this second step. 
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The present invention may be better understood by reference to the following non-limiting 
Example, which is provided as exemplary of the invention. The following example is 
presented in order to more fully illustrate the preferred embodiments of the invention. It 
should in no way be construed, however, as limiting the broad scope of the invention. 



F X AMPLE 

HNA Rinding of ^'ifrn activated punfied .Static. Sm\(i and truncated Statl: Interaction 
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henween NH , terminal Hnmains stabilizes binding of two dimers t o tandem DNA sites: 

Introduction 

To conveniently study the biochemistry of activated Stat molecules, it is necessary not 
5 only to use recombinant DNA techniques to produce large amounts of protein, but it is 
also necessary to phosphorylate the correct tyrosine residue and to separate the 
phosphorylated and nonphosphorylated proteins. The present invention teaches proteins, 
nucleic acids, and methods that satisfy these heretofore, unattained criteria. 

10 Human Statla and Statl/S, a shorter protein translated from an alternatively spliced 
mRNA, were produced in insect cells infected with recombinant baculovirus, thereby 
allowing milligram amounts of these proteins to be isolated at a time. The protease 
sensitivity of purified Statla was subsequently studied. A stable truncated form of Statl 
(Statltc) was then characterized and produced in bacteria. Stat la, Statl/3 and Statltc were 

15 quantitatively phosphorylated in vitro with immunoprecipitated, activated EGF-receptor 
kinase. The phosphoproteins were isolated in milligram quantities by a new 
chromatographic protocol, and the phosphorylation was shown to be on the correct 
tyrosine residue by mass spectroscopy of Statl fragments. Both the full length and the 
truncated phosphorylated protein dimerize and bind to DNA. 

20 

With the purified activated DNA binding form of Statl available, its DNA binding 
characteristics were studied. A KD of about 1 x lO"^ M for a variety of recognition . 
sequences was determined. By examining the stability of labelled preformed protein/DNA 
complexes when challenged with unlabelled DNA, we found a very short half-life of the 

25 protein/DNA complexes. For sites that showed the maximum binding stability, we 

determined a half-life, t,,,, of about 3 min. A more rapid exchange (half-life of < 30 sec) 
was observed for both Stat la or Statltc bound to the sites that are natural weak' binding 
sites in genomic DNA. Statl dimers (Guyer et al. , 1995) or dimers of Drosophila Stat 
protein (D-Stat) (Yan et al, 1996) may interact when two nearby Stat binding sites are 

30 both occupied. The purified activated human protein behaves in a similar manner based 
on evidence of interaction between bound dimeric molecules in which the binding of Stat 
dimers to adjacent DNA binding sites was stabilized when both sites were occupied. 
Furthermore this proposed Stat dimer interaction is dependent on the presence of the 
amino terminal 131 amino acids of Statl. 
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Materials and Methods 
Expression and punfication of Stat la and StatW. Nucleic acids containing sequences 
coding for human Statla and Statl/3 were amplified by PCR (pnmers containing 
respective restriction sites in addition to homologous sequence; Vent-polymerase; New 
England Biolabs) and the products cloned into the Stul/Bglll (Static) or EcoRI/Kpnl 
(Statl^3) - sites of the baculovirus transfer vector pAcSG2 (Pharmingen). Recombinant 
vectors were subsequently co-transfected with Baculogold baculovirus DNA (Pharmmgen) 
into Sf9 .nsect cells as described (Gruenwald and Heitz, 1993). Recombinant viruses were 
identified by immunoblot of extracts of infected cells. For protein production Sf9 cells m 
suspension culture (0.8 x 10^ cells/ml) were infected with recombinant viruses (mean of 
infection: 1.5) and harvested by centrifugation (1500 x g, 15 mm) 50 h post infectton. 



The cells (5-8 x 10^) were lysed in 80 ml ice cold extraction buffer [20 mM Mes, 100 

KCl 10 mM NaF, 10 mM Na.HPOVNaH.PO., pH 7.0. 10 mM NaPPi. 0.02% NaN3. 
15 4 n.M EDTA, 1 mM EGTA. 20 mM DTT, Complete™ protease inhibitors (Boehrmger 
Mamiheim), pH adjusted to 7.0 with 1 M Tris] with a dounce homogenizer (2 x 10 
strokes). All subsequent steps were performed at 4"C unless noted otherwise, Lysates 
were cleared bv centrifugation at 20.000 x g for 30 min. The supernatant was brought to 
pH 6 2 with 1 M Mes and after the addition of 0.5 vol buffer 1 (20 n^ Mes, 0.02% 
20 NaN3 20 mM DTT, pH adjusted to 6.0 with 1 M Tris) tt was agam centrifuged for 20 
min at 25 000 x g. The resulting supernatant was loaded onto a S-Sepharose (Pharmacia) 
column (5 x 5.5 cm) and eluted with a linear salt gradient (50-300 mM KCl) and pH 
gradient (pH 6-7). Stat protein containing fractions, identified by immunoblot, were 
pooled, the pH adjusted to 8.0 with 1 M Tris and after the addition of 0.25 vol buffer 2 
25 (^0 mM Tris/HCl, 0.02% NaN3, 10 mM DTT, pH 8.0) loaded onto a Q-Sepharose 

.Pharmacia) column (2 x 9 cm). This column was developed with a linear KCl gradient 
from 100 mM to 300 mM KCl. Eluted Statl proteins were precipitated with solid 
(NH,).SO, to 60% samration. The concentrated Stat proteins were dissolved in - 10 ml 
of buffer 3 [50 n^ Na.HPOVNaH.PO. pH 7.2, 2 mM DTT, 1 mM EDTA. Complete™ 
30 protease inhibitors]. N-ethyl-maleimide (Sigma) was added to a final concentration of 20 
mM The alkylation reaction mixture was incubated at room temperature for 10 mm and 
then placed on ice for another 30 min. The reaction was stopped by the addition of /3- 
niercaptoethanol to 50 mM and (NH,).SO. to 0.5 M. The reaction mixture was then 
loaded onto a low substituted Phenyl-Sepharose (Pharmacia) column (2 x 15 cm) 
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equil.brated in buffer 4 (20 mM Tris/HCl, 2 mM DTT, pH 7.4) + 0.5 M amn^onrnm 
sulfate and rhe Stat proteins were elated wuh decreasing (NH.),SO, m buffer 4. (The Stat 
protems eluted at about 300 mM salt). Fractions of .merest were pooled, concentrated by 
centrtprep 50 (Amtcon) to about 10 mg/ml and applied to a SUPERDEX 200 column (XK 

5 16 Pharmacia) equthbrated in buffer 5 (20 mM Hepes/HCl. 0.02% NaN3. 2 mM DTT, 
0 3 M KCl. pH 7.2). Fractions containing Statl« or Statl/3 were pooled. Both Statla and 
Statl/3 eluted very early, e.g. with a volume typical for globular protems of M. 350 kD. 
The pooled fractions were then concentrated by ultrafiltration to approximately 20 mg/ml 
and quick frozen on dry tee. The purified proteins were stored at -70"C. All buffers used 

10 during protein purification were chilled, thoroughly degassed and Hushed with N, before 
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Expression and purification ofStatltc. The portion of the human Statl gene encoding 
residues 132-713 was amplified by PCR ( Vent-Poly merase). The ^f^^^^ 
15 used: 5--dGGGAATTCCAlAIGAGCACAGTGATG-7^/U3A^^^^^ 

V-dr GGATCC TATTAGTGAACTTCAGACACAGAAATCYrestriction sites underlined). 

The product was cloned into the Ndel/BamHI sites of the pET20b expression vector 
(Novagen). N-terminal sequencing revealed the absence of the methionine residue 
introduced with the Ndel restriction site. Growth and induction of transformed E. coli 
20 [BL^1DE3 (pLysS)] was as described (Studier and Moffatt, 1986). About 50% of the 
induced protein remained soluble and was subsequently isolated. Cells were collected by 
centrifugation (20 min; 4''C; 20,000g) and resuspended in ice cold extraction buffer (100 
ml/30 g cells; 20 mM Hepes/HCl, 0.1 M KCl. 10% Glycerol, ImM EDTA, 10 mM 
MnCU, 20 mM DTT. 100 U/ml DNase I (Boehringer Mannheim), Complete^^' protease 
25 inhibitor. pH 7.6). Cells were lysed by three cycles of freeze/thawing. Lysis was 

continued at 4°C while stirring slowly for 1 h. The lysate was centrifuged for 20 mm at 
000 X o at rC. Polyethylemmine (0.1% final; S.gma) was addedao the supernatant, 
the solution gently mixed and centrifuged for 15 min at 15,000 x g. All subsequent steps 
were performed in the cold (4"C) unless stated otherwise. The supernatant contaming 
30 soluble Statue was precipitated with saturated ammonium sulfate solution (ultrapure; 

Gibco) in two steps (0-35%; 35-55% saturation final). The 35-55% pellet was redissolved 
in 20 ml of buffer 3 (see above) and alkylated as described above. The reaction was 
stopped by the addition of /3-mercaptoethanol to 50 mM and solid ammonium sulfate to 
0 9 M The mixture was loaded onto a Fast Flow Phenyl-SEPHAROSE column (low 
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substituted, 2 X 15 cm) that had been equ.librated in buffer A (50 mM Tris/HCl, 1 mM 
EDTA. 0.02% NaN.. 2 mM DTT, pH 7.4) + 0.9 M ammonium sulfate. After washing 
the column a linear gradient from 0.9 M to 0.05 M ammomum sulfate in buffer A was 
applied. Statue elated at about 0.5 M salt and the Statltc containing fractions were pooled 
5 and dialysed overnight against 2 x 4 liters of buffer B (40 mM Mes/NaOH, 10% 

Glvcerol, 0,5 mM EDTA, 0.02% NaN,. pH 6.5) + 140 mM KCl. This material was 
loaded onto a S-Sepharose column (5 x 5.5 cm) and a linear 500 ml gradient of buffer B 
containing 140 mM to 300 mM KCl was applied. The protein eluted in at approximately 
-^20 mM KCL. Fractions of interest were collected and dialysed against 3 liters of buffer 
10 C (50 mM Tris/HCl, 10% Glycerol, 2mM DTT, pH 8) + 50 mM KCl with one change 
of buffer. The protein solution (in buffer C + 50 mM KCl) w_as then applied to a Q- 
Sepharose column (2 x 9 cm) and bound proteins were eluted with a linear gradient from 
50 to 300 mM KCl in buffer C. Fractions with Statltc were combined and precipitated 
with soUd ammonium sulfate to 55% saturation. At this stage the 95% pure preparation 
15 could be stored at -20"C until subjected to in vitro phosphorylation (see below) or was 
directly loaded onto a SUPERDEX 200 gel filtration column (XK 16; Pharmacia). In this 
case the precipitated protein was dissolved in about 2 ml of 10 mM Hepes/HCl, 100 mM 
KCl 2 mM DTT, 0.5 mM EDTA, pH 7.4 and gel filtrated in this buffer. Statltc eluted m 
a symmetrical peak and was concentrated to about 20 mg/ml (Centriprep 50), quick frozen 
20 on dry ice and stored at -70"C . Typically yields of 40-50 mg (greater than 98 % pure as 
judged by Coomass.e stain and mass spectroscopy) Statltc from 6 liters of starting culture 
could be obtained. 

Determination of protein concentrations. Purified proteins were quantitated by UV 
25 spectroscopy. The extinction coefficient e in a 1 cm path length for a 1 mg/ml solution of 
protein can be estimated by the formula [(5700 x W + 1300 x Y)/M,] with W = number 
of tryptophans; Y = number of tyrosines and = molecular weight (Cantor and ^ 
Schimmel, 1980). The following extinction values (mM 'cm '> were used: Statla: e - 
1.25; Statl^: e = 1-31; Statltc: e = 1.27. 



30 



Proteolytic digestion of Statla and amino-terrrnnal sequencing of fragments. Proteinase K 
and subtilisin (Sigma) digests of purified Statla were carried out for 30 minutes on ice. 
The protein was digested at the concentration of 4.5 ,M in 50 ^1 of cleavage buffer which 
contained 20 mM Hepes./HCl. 50 mM ammonium sulfate, and 10 mM MgCU, pH 7.4. 
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Reactions were stopped by the addition of PMSF (2 mM final) and SDS-sample buffer. 
The proteolysis was resolved on a 10% or 16.5% SDS PAGE gel, which was euher 
stained wuh Coomass.e blue or electro-transferred onto a PVDF rBembranedmmobtlon 
ps- MiUipore;. Sequencing of the anuno terminus of the 65 kDa protease resistent Static 
5 fragment was performed as described by LeGendre and Matsudaira, ( 1988). Am.no 
terminal sequence analysts was performed by the Protein /DNA fac.Uties at The 
Rockefeller University. 

Cyanogen brormde and Endoproteinase AspN digests .ith mass spectrometnc peptide 
10 analysis. Cyanogen bromide (Sigma) digests were performed on 90 pmol of recomb.nant 
protein in 50% formtc acid at 25»C in the dark. Endoproteinase AspN (sequencing grade; 
Boehringer Mannheim) digests were earned out on 100-150 pmol of protein in either 25 
mM Tris/HCl (pH 7.5) or 10 mM ammonium phosphate buffer (pH 8) with 150 mM KCl 
at 25°C The protease:protein ratio was 1:50 by weight, e.g., 0.2 ^g: 10 Mg- Matrix- 
15 assisted laser desorptton/ionization mass spectrometry (MALDI-MS) was used to evaluate 
the peptide fragments. Aliquots (0.5 mD of the digest were taken at various intervals (1 
min to 7 hours), directly mixed into the MALDI-MS matrix solution (Cohen and Chait, 
1996), and subject to MALDI-MS analysis in a procedure reported earlier (Cohen et al., 
1995). 
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Preparation of EGF-receptor kinase and in vitro phosphorylation of Stat proteins. Human 
carcinoma A431 cells were grown to 90% confluency in 150 mm diameter plates in 
Dulbecco-s modified Eagle's medium supplemented with 10% bovine calf serum 
(Hyclone) Cells were washed once with chilled PBS and lysates were prepared in 1 ml 
25 ice cold lysis buffer (lOmM Hepes/HCl, 150 mM NaCl. 0.5% Triton X-100, 10% 

Glvcerol. 1 mM Na3VO„ 10 mM EDTA, Complete™ protease inhibitors, pH 7.5). After 
10 min on ice. the cells were scraped, vortexed and dounce homoge^uzed (5 strokes). The 
lysates were cleared by centritugation at 4°C for 20 min at top speed in an Eppendorf 
microfuge and stored at -70"C until needed. Immediately before use 1 volume of the lysate 
30 was diluted with 4 volumes of the lysis buffer ("diluted lysate"). 

EGF-receptor precipitates were obtained by incubating 5 ml of diluted lysate with 50 of 
an anti-EGF-receptor monoclonal antibody directed against the extracellular domain. 
After 2 hours of rotating the sample at 4°C, 750 ^1 of Protein-A-agarose (50% slurry; 
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Oncogene Scence) was added, and the incubation was allowed to proceed, while ro.at.ng, 
for another 1 hour. Agarose beads containing the EGF-receptor inununoprecip.tates were 
then washed 5 times wuh lysis buffer and finally twice wUh storage buffer (20% Glycerol. 
.0 mM Hepes/HCl. 100 mM NaCl, 0.1 inM Na3V0.). Precipitates from 5 ml diluted 

5 lysate were dissolved in 0.5 ml storage buffer, flash frozen on dry ice and stored at -70°C. 
immediately before an m vuro kinase reaction the Protein-A-agarose bound EGF-receptor 
from 5 ml dilute lysate was washed once with Ix kinase buffer (20 mM Tris/HCl. 50 mM 
KCl 0 3 mM Na^VO,, 2 mM DTT, pH 8.0) plus 50 mM KCL and then dissolved in 0.4 
ml (total volume) of this buffer. Afterwards the washed EGF-receptor precipitate was 

10 incubated on ice for 10 min in the presence of a final concentration of mouse EGF of 0. 15 
ng/Ml. Phosphorylation reactions were carried out in Eppendorf tubes in a fmal volume of 
1 ml To the pre-incubated kinase preparation the following was added: 60 ;.l lOx kmase 
buffer 20 ^1 0.1 M DTT, 50 ^1 O.l M ATP, 4 mg Stat protein (SUPERDEX 200 eluate 
for Static and Statl |3; ammonium sulfate pellets dissolved in [20 mM Tris /HCl, pH 8.0] 

15 for Statltc). 10 ^1 IM MnCl, and dHp to 1 ml. The reaction was allowed to proceed for 
15 hours at 4°C. After 3 hours an additional 15 ^1 of 0.1 M ATP was added. 

Separation of phosphorylated from unphosphorylated Stat proteins. The irt vitro kinase 
reaction mixture (see above) was freed from EGF-receptor bound to agarose beads by 
20 spinning through a plug of siliconized glass wool at the bottom of a pierced Eppendorf 

tube. The glass wool was washed with 0.5 ml HA-buffer (20 i^ Tris/HCl. ImM EDTA, 
2 mM DTT pH 8 0) and the pooled volumes loaded onto a heparin agarose (Biorad) 
column (1.5 x 7 cm). The column was washed with 50 ml HA-buffer, and then the bound 
Stat proteins were eluted with two consecutive 50 ml volumes of HA-buffer + 150 mM 
25 KCl and then HA-buffer 4-400 mM KCl. Unphosphorylated proteins (eluted with 150 mM 
KCl) were concentrated by ultrafiltration to about 10 mg/ml, flash frozen on dry ice and 
stored at -70^C. Phosphorylated Static, and Static was concentrated to Img'ml, Glycerol 
was added to 50% (vol/vol) and the material was stored at -20"C. Phosphorylated Statltc 
was brought to a concentration of about 15 mg/ml and run on a SUPERDEX 200 columns 
30 under the conditions described above for the native protein. The gel filtered 

phosphorylated Statltc was pooled, concentrated to approximately 20 mg/ml, flash frozen 
on dry ice and stored at -70°C. 

Electrophoretic mobility shift assays (EMSA). A 12.5 , 1 reaction volume contained DNA 
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binding buffer (20 mM Hepes/HCl, 4% F.coll 40 mM KCl. 10 mM MgCl, 10 mM 
CaCK. I mM DTT) radiolabelled DNA (see below) at a final concentration of 1 x 10"' M 
unless stated otherwise, 50 ng dIdC, 0.2 mg/ml BSA (Boehringer Mannheim), and the 
indicated amount of purified phosphorylated Statl. The reaction volume was mixed and 

5 then incubated at room temperamre. The time necessary to reach equilibrium was assessed 
by EMSA [(Stone et al., 1991)1. For all DNA fragments tested, equilibrium mrned out to 
be fully established at the earliest timepoint that can be determined by this technique (30 
sec). Therefore incubation periods of 5-15 minutes were chosen. Reaction products were 
loaded onto a 4% polyacrylamide gel (1.5 mm thick) containing 0.25 x Tns-borate-EDTA 

10 which had been pre-run at 20V/cm for 2 hours at 4«C. Electrophoresis was continued for 
60 minutes at 4"C. Gels were dried and exposed to X-ray film and quantitated by a 
Molecular Dynamics Phospholmager. 

Binding site oligonucleotides. Single-stranded oligonucleotides that were purified on the 
15 basis of trityl affinity were obtained from The Great American Gene Company (Ransom 
Hill). Oligonucleotides longer than 30 nucleotides were further purified on 6% 
sequencing gels and DNA recovered by soak elution and ethanol precipitation. Nucleic 
acid concentrations were determined by absorbance at 260 nm using the calculated molar 
extinction coefficient for each oligonucleotide (corrected for the hyperchromic effect). 
20 Complementary oligonucleotides at a concentration of 1 pmol/^l were hybridized for 3 

hours after thermal denaturation in 5 mM Tris/HCl, 50 mM KCl, 10 mM MgCl,, pH 8.0. 
One pmol of synthetic duplex molecule was labelled to high specific activity by the 
Klenow fill-in reaction (0.5 mM dATP (and 0.5 mM dCTP for SI). lOO^Ci [a^P] dGTP 

(3000 Ci/mmol; lOmCi/ml; and [a^^'P] dTTP for SI; Du Pont), 5 Units of ExoKlenow 
25 enzyme (New England Biolabs)) and rendered completely double-stranded with a 0.5 mM 

dGTP (and 0.5 mM dTTP for SI) cold chase. Unincorporated nucleotides were removed 

by gel filtration (spin quant columns; Pharmacia) in 10 mM Tris,'HCl. 100 mM NaCl, 1 

mM EDTA, pH 8.0. Labelled oligonucleotides were stored at 4°C. 

30 The following duplex DNA fragments with protruding 5'-TCC (except for SI which has 
5'-GATC) were used (tjae core reco^mtioji ^equence is underlined): cf^T 

dGTAIICCCm^ATG^^-^^^ ) 
V-dr.AT TTCCCGTAA ATCAT-3!f siyidbTTGitCCGG^ 2x cfo^^^^^-- , J 

(10 bp spacing) 5'-dAGTCAGIICCCGICAATGCATCAGGIICCCGICAATGCAT-3;, 
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2x cfosWKS bp spacing) S'-dAGTCAGTTCCCGICAATGAGIICCCCT^AATGCA^ 

2x cfosWT (15 bp spacing) -x: As -^^-^ I J 

5 -dAGTCAGIICCCGICAATGATCGCTACAGAG TTCCCGTCA AGCA-3\ 2x 

cfosWT (inverted repeat) / '->(_, cZl \j '■ lU ) 

5 5'-dAGTCATIICCCGICAATGCATCAGTIGACGGGAAAGTAGT-3\ 



Dissodation rate determination. Under the reaction conditions described above, each 
oligonucleot.de (at 2 x 10 ' M or otherwise stated) was mixed with 0.55 x 10' M d.mer of 
purified phosphorylated Statl protein. The reaction volume was scaled up to 100 The 
10 reaction was incubated for 5-15 min at room temperature and for time zero, an aliquot (10 
,.1) was removed and loaded directly onto a pre-run polyacrylamide gel (see above). 
Afterwards, a 100 x molar excess of homologous unlabelled DNA (in less than|%,of the 
reaction volume) was added. At subsequent time points (indicated in Figures^, 6 and 7) 
10 Ml aliquots were withdrawn and also loaded onto the running gel (at 10 V/cm). After 
15 entering the final time point (after 30-45 min), electrophoresis was continued at 20 V/cm 
until the unbound labelled DNA-fraction reached the bottom of the gel. Gels were dried, 
exposed to X-ray film and labelled protein/DNA complexes and unbound labelled DNA 
were quantitated as described above. The half life was determined from a semi-log plot of 
the numerical data (shifted radioactivity over shifted radioactivity at time zero versus 
20 time). For many sequences studied, the half life was too short ( > 30 sec) to be 

determined by EMSA. All experiments were performed at least twice with the different 
oligonucleotides. 

Determination of apparent equilibrium constants for protein: DNA interactions. A fixed 
25 quantity of "P labelled oligonucleotide varied between 1 x 10- M and 5.6 x 10- M in 
three separate experiments, was titrated against a standard protein dilution senes (common 
to all oligonucleotides tested) in a volume of 12.5 ^1 under the reaction conditions 
described above. Numerical data were used to construct a standard binding curve from 
which the free dimer concentration, when 50% of the probe is shifted, could be 
30 determined. 

Results : 

Production by recombinant techniques and purification of Statl : cDNA encoding Statla 
or Statl/3 was inserted in baculovirus transfer vector (pAcSG2) and co-transfected with 
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„,„.,f,c<i Wnearued AcPNV bacu.ovirus DNA ,o produce v.rus pan.des. ,ns=c, cells <Sf9 
cells, infece. .he respecive r.co™.ina„, baculovl.s produce, a 9. .Da pro.,„ and 

blo, analysis. These pro,e,„s were purrfied <Fig lA, ihrough ,he s.eps indicared ,n Table 
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Table I 

Purification of Stat la//3 





STEP 


VOLUME 
(ml) 


PROTEIN (mg) 


I 


Crude Extract" 


80 


550" 


II 


S-SEPHAROSE . 


120 


30- 


III 


Q-SEPHAROSE 


30 


12* 


IV 


Ammonium Sulfate 


1 


8" 


V 


Alkylation 


10 


8" 


VI 


Phenyl-SEPHAROSE 


25 


6= 


VII 


SUPERDEX 


3 


5'= 



" Following precipitation at pH 6.2 from 5 x 10' cells. 

" Protein concentrations were determined by the method of a dye-binding assay 

(Bradford, 1976) using bovine serum albumin as the protein standard. 

= Protein determined by ultraviolet light absorbance as described in METHODS. 
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Statla is 750 amino acids long. Statl/3 is a product of a differentially spliced mRNA 
which encodes a protein 712 amino acids long (Schmdler ecal., 1992; Yan etal.. 1995). 
It is known that both Statla and 1/3 can be phosphorylated on a single tyrosme. residue 
701. In VIVO, both forms of the protein dimerize upon phosphorylation, and then 
5 translocate to the nucleus to bind specific DN A sites (Shuai et al. ,1992; Shuai et al. , 
1993 a). 

The purified Statla was digested with several proteolytic enzymes to determine whether 
the protein could be divided into functional domains. Both subtilisin and proteinase K 

10 produced two major digestion products (Fig. IB), the largest of which migrated on SDS 
polyacrylamide gel electrophoresis with an estimated size of 65 kDa, as compared with the 
full length protein of 91 kDa." (Cleavage products of approximately 40 and 30 kDa were 
also seen). The 65 kDa product had an N-terminal sequence of XTVMLDKQEKE 
indicating that it resulted from cleavage between residues 131 and 132 of the full length 

15 protein. A single prominent smaller fragment of about 16 kDa was also observed. This 
fragment was the only one generated that retained reactivity with an antibody raised 
against the amino terminus of Statl. The shorter 16 kDa fragment was therefore 
identified as an N-terminal fragment of the molecule. 

20 The major proteolytic cleavage fragment, which was also the longest, began at residue 
132. This fragment was poorly recognized by an antibody to the carboxyl terminal 38 
amino acids of Statla which indicated an additional cleavage near the carboxyl terminus. 
A bacterial expression clone encoding residues 132-713 was prepared since this fragment 
was shown to be resistant to further proteolysis (above), and Statl/3, which terminates at 

25 residue 712, is known to be active form of the protein in vivo. The product, Statl (132- 
713) or Statltc. was expressed in relatively large quantities in E. coH and a major fraction 
of the protein proved to be soluble. Statltc was purified to homogeneity (Fig. 1 A and as 
in the Materials and Methods, above). The recombinant truncated Stat protein of the 
present invention appears to be a unique form of Stat protein, since the Stat fragments 

30 listed in Table II were found to essentially accumulate entirely in inclusion bodies. 
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Table II 



Solubility of Recombina nt Stat 1 Fragments 



AMINO TbRMlINUi 


CARBOXYL 
TERMINUS 


SOLUBLE 


132 


713 


YES 


200 


713 


NO 


250 


713 


NO 


300 


713 


NO 


370 


713 


NO 


420 


713 


NO 



The expression vectors for the nucleic acids coding the amino acid 
sequences for the protein fragments of Statl, listed above, were constructed and 
expressed as described in the METHODS for the truncated protein Statl, Statltc. The 
sequences are based on the Statla as described above. The positive (YES) denotation for 
being soluble, is indicative of significant quantities of the corresponding protein fragment 
being free of the inclusion bodies. As can be seen from the table, only the truncated Stat 
protein of the present invention (132-713) was found to occur free of inclusion bodies in 
significant quantities. 
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A,greganon of naUve proteins: It appeared possible that aggregatior. of the protem 
occurred since pur.f.ed Static. Static ar^d Statltc eluted .n peaks with broad lead.ng 
shoulders, durmg ge. fltration. Th.ol crosslir.lc.ng was suspected as the cause, stnce the 
preparation had aggregates that migrated with an apparent molecular mass correspondmp. 

5 to dimers and h.gher order oligomers when run under non-reducing condutons on a 

denaturing polyacrylamide gel (not shown). Accordingly, to block the react.ve thiols, the 
cell extracts (from baculovirus infected Sf9 cells for Static and transformed E. coh for 
Statltc) were mcubated with N-ethyl maleimide (NEM) to test tf the modification of the 
cysteine residues: (I) could prevent the aggregation, and (2) whether such modification 

10 would lead to a non-aggregated protein preparation that retained its functional properties. 
The procedure worked unexpectedly well and this alkylation s_tep became part of the 
purification procedure (Table 1). 

The purified protein was cleaved with cyanogen bromide and Endoprotease Asp-N. Mass 
15 spectrometnc analysis of the resulting peptides showed that cysteines 155, 440. and 492 
were alkylated by the NEM treatment, whereas two other cysteines were not (Cys 5.2 and 
Cys 577). The NEM treatmem did not affect any of the subsequent experiments (e.g.. 
DNA binding, see Fig. 3B) and was adopted as the standard preparation of a 
homogeneous protein. 

m .uro pkospkoryianon of Statla. StatW and truncated Statl by tHe EGF-receptor. The 
in vivo activated DNA binding form of Statl is phosphorylated on tyrosine 701 when 
isolated from mammalian cells treated with ligands that activate either JAK kinases or 
transmembrane receptor kinases (Shuai era/.. 1992; Shuai a/. , 1993b). EGF-receptor 
25 kinase activity was achieved with immunoprecipitates of membrane preparations from 
cultured human A431 cells that express 5 x 10^ EGF-receptors per cell (Yarden et ai, 
1985- Quelle et al.. 1995). These membrane preparations were use4 as the source ot 
enzyme to catalyze the tyrosine phosphorylation of Statl and the truncated Statl. 

30 As detailed above, m vivo, Statla is phosphorylated on a specific tyrosine residue 

(Tyr701) The resulting phosphorylated form of the protein runs at a slightly slower .at. 
during polvacrylamide ge, electrophoresis, in comparison to the nonphosphorylated form 
(Shuai et al 1992) This same change in mobility was observed after purified Static v. 
treated in vUro with EGF-receptor kinase preparations. In addition, when the enzymat.c 
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reaction was carried out in the presence of -P7ATP, the slower running protein was 
found to contain (Figure 2A). Similar results were obtained for the vitro 
phosphorylation of Statltc. However, it was clear that not all of the Statl protein was 
phosphorylated (Figure 2A). Although subsequent experiments yielded somewhat higher 
5 amounts of phosphorylation, the percentage of Stat protein that was phosphorylated never 
exceeded 75 % . 

Therefore a method of separating phosphorylated from nonphosphorylated Stat protein was 
required, Although the phosphorylated protein forms a dimer. this dimer elutes in a peak 
10 strongly overlapping the elution peak of the corresponding nonphosphorylated monomer. 
Therefore, alternative means was required. After many unsuccessful attempts using 
various chromatography procedures, step-wise elution ot>ej^tein mixmre bound to 
heparin agarose proved surprisingly successful (Figure-^^).-this novel procedure resulted 
in a separation of two peaks containing Stat proteins (eltjedjn^eps of 150 mM and 400 
15 mM KCl). The tyrosine phosphorylated protein (Figure^ wWch, in addition, had DNA 
binding activity, was present in the second of these two chromatographic peaks. 

To determine the purity of the isolated material and to analyze whether the correct 
tyrosine residue was phosphorylated, both purified, unphosphorylated {i.e.. protein not 
20 reacted with EGF-receptor) and phosphorylated protein (/. , protein obtained from the 
chromatographic peak containing phosphotyrosine from the heparin agarose column 
described above) were subjected to Endoprotease Asp-N d^tion and the resulting 
peptide fragments analyzed by mass spectrometry (Figure^Sq). Phosphorylation increases 
the molecular mass of an unphosphorylated fragment by 80 daltons, that is, comparison of 
25 the Asp-N fragments of phosphorylated versus uphosphorylated Stafs showed an 80 
dalton shift of the fragment 694 - 720 (Figure^2S, demonstrating that in vitro 
phosphorylation by EGF-receptor kinase occurred exclusively on the, single tyrosin^. 
residue that is phosphorylated in the cell. In addition, the bottom panel of Figure^^ 
demonstrates the absence of unphosphorylated Tyr 701 in the purified EGF-receptor 
30 kinase-treated protein. 

Both in vitro phosphorylated Statla and Statltc bind specific DNA fragments: 
Electrophoretic mobility shift assays (EMSA) (Fried and Crothers, 1981: Garner and 
Revzin, 1981) were used to test DNA binding of tyrosine phosphorylated Statla and 



67 



Statltc Both protems were found to bind to all tested labelled deoxyoHgonucleottdes 
kr.owr from earlier studies to bind Statl (the oHgo cfosWT is illustrated m Figure 3A). 
The bound complexes were not affected by N-ethyl maleimide indicating that alkylat.on ot 
cysteine does not affect DNA binding (Fig. 3B). This result is consistent with earlier 
5 experiments showing that ISGFSc, now known to be a Statl:2 heterodimer. is not affected 
by NEM treatment (Levy et al., 1989). In addition, the DNA binding ability of 
homodimeric phosphorylated Static or its truncated form was highly resistant to up to 2 
Molar monovalent salt concentrations. 

10 strength of Statl binding to DNA and estimation of dissociation rates. We next used the 
EMSA assay to obtain an estimate of the binding affinity of Static and Statltc to DNA. 
Both forms of the protein behaved identically when using a gc,d amou^^^ 
deoxyoligonucleotide and increasing protein concentrations (^i^)- A K,, of 
approximately 1 x 10' M was estimated from this data for both proteins when the bound 
15 and unbound fraction of DNA was compared as a function of protein concentration. This 
IS in the affinity range for transcription factors in general which have been reported to 
have a K between 10 ' and 10-^ M for proteins with the highest affinity for their cognate 
DNA sitel (Riggs et al., 1970; Affolter et al., 1990). The same results were obtained 
with several different oligonucleotides, the Ly6E and cf^WT Stat binding sites, which are 
20 "weak" binding sites, and "strong" sites, such as the selected optimum site. SI (Horvath et 
al 1995) and a mutated cfo^sequence (M67 site; Wagner et al., 1990). ["Strong" and 
"weak" in this context refer to experiments with cell extracts containing activated Statl 
which binds more of some oligonucleotides (strong) than others (weak).] 

25 The stability of preformed DNA protein complexes were examined by the following 
method: the formation of a complex between protein and labelled oligonucleotides is 
allowed to occur and unlabelled oligonucleotides are added m vast molar excess after the 
reaction reaches equilibrium. At various times after the addition of unlabelled competitor 
DNA aliquots are layered on a running native polyacrylan.ide gel to determine free and 

30 bound oligonucleotides. This type of experiment was carried out with both Static, and 
Statltc and with two different labelled DNAs, the namral cfos site, an example of a 
"weak" site, and the mutated cfos-promotor element (M67) an example of a "strong" site. 

With the "weak" site, the "off" time was so short that the addition of unlabelled 
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„„c,eo,i.^ to, a. „..le .s 30 seconds reeved a„ prefo^e. pro.e.n DNA complexes 
<Fi.„„,fe- S,a,l« .how„ in ,he l.f. pa«l). Wi.h ,he -STOng ' site, .he Pretoria 
Jne:co.p.exe, »ere d.p.acea .ore s,o.,y. ..e ..P. is es.™a,ed ,o .e 3 ™nu,es 
,P„„„ 5B, ng. pane, en,p,o,e. S.a.UC ,n .hese e.pen™en,s ... was no .rference 

5 between Statla and Statltc. 

S,a, U.^,n, ,0 ,We™ UNA si.es: E.Mence fo. s,a,i,ae, ,ro,no,cr ^cupa.c, ,Hrou^H 
pro.em:pro,em Mer^tions of Sml. and StcW «rsus SmUc- 
TWO recen, repom on pron,oters of genes dependen. for .ranscrip,ion on S,a, pro.e.ns 
10 have indicated ,ha> cwo neighboring S.ar b.nding si.es are required for max.ma. 

„a„scr,p,io„a> scrnruiarion, ,n one of rhese reporrs rhe hu„.n mig gene pro^ore. was 
Zd : have ,wo wea. S,a,. binding si.es wi.hin .3 bp. ne,.her of which aione — 
,FN-, .,ar.crip.,onal ac.,va.io„ while bo.h s..es .oge.he, did so. Moreover .he ac rve 
eL „. fornred complexes wi.h S.a.f pro.ein .ha, .igrared .ore s.oWy ,h.n S,a„ d . rs 
„ bound ,0 DNA. The au.hors suggesred .ha. in,erac„on he.ween — 

occur in .he complexes (Guyer « a,.. 1995). ,n addi.ion. we recen.,y reporred .wo D- . . 
Lrng si.es were found ,„ .he seg.e„. of ,he ...M P-o.er .ha. d.recs s.r.pe 3 
fomaarion in Dlfi^ enrbryos; bo.h si.es were required tor maximun, s.r.pe 
expression {Ymetal, 1996). 

W,.h .he presen. de.ons,r..io„ .ha. S.a.,„ pro,e,n indeed does have such a rapid off-„.e. 

o,ig„„uc,eo.idesco„,a,ning,wo weak DMA binding s,.es was inves„ga,ed. The 

eri.en,s were carried ou. wi.h bo.h S.a.,„ and S.a.,.c and a ,abe„ed o„gonue,eo„d 
„ ZZ. a varie,y of arrange,„e„,s of .wo -weaK- binding si,es. Wi,h ,wo bind.ng s es 
p Ln, in ,ande. on ,he sa.e ONA frag^en, and a. a .odera.e.y h.gh concen,ra.,or, 
prcein .0 55 . ,0' M). S.a„« and S,a.,.c each formed bo.h a homW.n>er comp.ex an. 

add, ona, con,p,ex .ha. ..gra.ed .ore sfowiy .x ,d,n,eric>,. — -^^^ 
siower n,ov,ng co,„piex sugges.ed occupa.io„ of bo.h ONA bind.ng s,.es. . ca, one 
30 DNA mo,eeu,e wi,h .wo S.a. dimers bound .o i. (F.gure 6A. ..me zero,. When su h 

Zils were chaiienged for various .i.es w.,h an e.xcess of un,abe„ed „,igo„uc,eo.,de, 
ZL di.er,c and .x <din,eric„ complexes were d,spe„ed bu. w,.h d„^^^^^^^ 
for S.a„„ and S,a,„c. The S,a.„c showed a.mos. in^ed.aie d.sp,acen,e . (iess 
.,„u.e, of bo.h dimeric and P-x ,d,™r,c), co.p,exes (Figure 6A, ,ef.,, ,n con.ras,, 
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whereas as anticipated, the Statla hon.od>mer also disappeared quickly, the [2x (dimer,c)3 
complex required more thar. 30 mtr. for partial dtsplacemer^t, mdicattng a s.gn.t.cant 
increase in stabiUty of this larger complex with the full ler^gth proteins. 

5 These results suggested that when Static is bound at tandem b.nding sites. prote.n:protein 
interactions occur that require the presence of the amtno and/or carboxyl termtnal domam 
of Statla to form the more stable DNA.protein complexes. To examine thts question we 
compared Statltc ,n the chase assay with the Statl/3 protein, wh>ch only lacks the C- 
terminal domain. As shown in Ftgure 6B, Static exhibits the same behavior as the full 

10 length protein, indicating involvement solely of the amino terminal region (between ammo 
acids 1 and 131) in stabilizing the [2x (dimeric)] complexes. ^ 

We then tested the importance of the orientation and the spacing of the two Stat binding 
sues within the synthetic oligonucleotides. First the DNA sites that exhibited stabilization 

15 m [2x (dimeric)] binding were changed from tandem (- -*) to inverted (- -), keeping 
the spacing at 10 basepairs (bp) between the two binding sites. While both 
oligonucleotides were capable of binding two dimers (with the tandem binding sites m 
inverted orientation showing much less of the [2x (dimeric)] complex even at relatively 
high protein to DNA ratio), the inverted sites showed no increased stability when 

20 challenged with unlabelled oligonucleotide (Fig. 7A). 

Oligonucleotides with tandem binding sites spaced by 5 or 15 bp were prepared to 
compare with the original oligonucleotide with 10 bp spacing. The oligonucleotide with a 

15 bp spacing behaved indistinguishably from the one with 10 bp spacing, while the 
25 oligonucleotide with 5 bp spacing showed much less evidence of enhanced stability of the 

[2x (dimeric)] complex, suggesting that protein:protein interaction was less likely when the 

DNA spacer was of inadequate length (Figure 7B). 

Discussion : 

30 The production of three purified Statl protein preparations from recombinant DNA 

constructs was achieved: Statla and Statl/3 from baculovirus infected insect cells, and a 
Statltc from E. coli. Digestion of purified Statla protein suggested a compact domain in 
Che amino terminus of 131 amino acids and a relatively protease-resistant large carboxyl 
terminal fragment (132-712). Activated EGF-receptor partially purified from membranes 
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by .n^unoprecipUation was capable of .Uro catalysis of the phosphorylation of tyros,r. 
701 of Statl.. Static, and Statltc. This is the same tyrosine that ts phosphorylated tn 
.V. by eUher IFN-a, IFN-. or EOF treatment of cells (Shuai , 1992; Shua, et al. , 
1993a) Th>s .Uro approach was more efficient ,n generating activated Statl molecules 

5 than prevtous attempts that employed either co-infect.on of Statl and a JAK kmase m the 
baculcvirus/insect cell system ^n n.o, or in .itro kinase assays wtth JAK k.nases 
[unpublished observations and (Yan H. et al, 1996)]. These results on .nro 
phosphorylation of the protetn plus allcylatton to prevent aggregation, coupled wtth an 
adequate chromatographic protocol, allowed the puriftcatton of milligram quantu.es of 

W activated prote.n. These techniques are also be applicable to other Stat molecules such as 
Stat 2, 3, 4, 5A, 5B, and 6. 

Analysis of the peptides derived from the purified phosphorylated protein by mass 
spectrometry, did not reveal significant contamination with unreacted Stat monomers. A 1 
15 three tyrostne-phosphorylated Statl derivatives dimerized and, as tested by EMSA. bound 
to the same DNA oligonucleotides previously shown to bind activated Statl m cell 

extracts. 

The structure of the Stat protein is expected to be complex considering the number of 
20 interactions these proteins must undergo. The region from residues 400-500 specifies 
DNA contacts (Horvath et al, 1995). while the carboxyl terminal half of the molecule 
contains the recognizable SH2 and putative SH3 domains (Fu, 1992; Schindler et al, 
1992) and the carboxyl terminus comprises the transactivation domain (Muller et al, 
1993- Wen et al 1995). From the digestion by proteases which released an amino 
25 terminal and a carboxyl terminal fragment a compact structure for the amino terminal of 
about 131 amino acids, is indicated. In addition there is a large stable fragment beginning 
at amino acid 132 that can be phosphorylated on a specific tyrosine and dimenze. The 
Statl protem binds to various DNA fragments with a K., of 1 x 10^ M. Compared to 
other regulatory proteins this is a relatively modest affinity. Despite having similar 
30 apparent K., values, the binding with DNA may differ significantly in rates of association 
with and dissociation from the Stat protein. The Statl protein achieves equilibrium in 

viiv ftr nuicker (less than 30 seconds) than the EMSA technique 
DNA binding very rapidly, tar quicKer ucb^ uia 

can determine. When the stability of Statl protein preparations to the various Statl 
binding sites were examined, measurable differences became apparent. Although the 
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protein/DNA complex had a half life of no more than 3 minutes for any of the sues tested, 
the off times for different oligonucleot.des varied by at least s.x-fo.d. The difference 
between "strong" and "weak" oUgonucleotide binding as detected in gel sh.ft assays was 
found to be due to the rapid "off" t.me tn competition assays w.th the displacement from 
5 "weak" sites being essentially instantaneous. Regarding the DNA binding activities of the 
Stat dimer to a single recognition sequence, no differences between the full length Static 
and the carboxyl- and amino terminally truncated Statltc was observed. 

The new finding of great potential biological relevance in the.e studies concerns the 
W cooperative stabilization of Stat homodimers on neighboring binding sites. This was 
observed when two tandem sites (separated by 10 or 15 bp) were both occupied by 
homodimers. A large complex was formed consisting presutriably of two homodimers 
which was more stable to competition with unlabelled oligonucleotides than one dimer 
binding to a single site. This interaction required a mmimum spacing (greater than 5 
15 basepairs) between adjacent sites and was strongly orientation-dependent, i.e., it occurred 
only if both recognition sequences were in tandem. 

Additionally a domain in the Statl molecule required for this dimer:dimer interaction was 
determined. The Statl/3 lacking the carboxyl terminal 38 amino acids showed the same 
20 stabilization of the [2x (dimeric)] Stat complex on the DNA as the full length protein. 
However, the truncated protein Statltc that lacks the amino terminal 131 amino acids (as 
well as the carboxyl terminal sequence) formed the higher order complex less well, and 
this complex was not stabilized during oligonucleotide competition. Thus the ammo 
terminal 131 amino acids of Statl defined by proteolysis as a stable domain, and which is 
25 dispensable for dimer formation and binding to single DNA sites, participates in Stat 

dimer dimer interaction on tandem DNA sites. Interestingly, the isolated ammo termmai 
domain dimerizes in solution. The amino terminus of the Stats sho^^s rather mgh 
sequence homology (Schindler and Darnell, 1995). indicating that protein:protein 
interaction in this domain is of general importance in Stat function. Since there is 
30 evidence from the mig-gene (Guyer et al. , 1995) that neighboring "weak" Stat binding 
sites are required for a IFN-. response, it indicates that the interaction we describe has a 
biological role. 

• • . limited in ^cooe bv the specific embodiments describe 
The present invention is not to be hmited m scope oy y 
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herein Indeed, various modifications of the invention in addition to those described 
herein wUl become apparent to those skilled >n the art from the foregoing descript.on and 
Che accompanymg figures. Such modifications are intended to fall withm the scope of the 

appended claims. 

It is further to be understood that all base sizes or ammo acid sizes, and all molecular 
weight or molecular mass values, given for nucleic acids or polypeptides are approximate. 

and are provided for description. 

10 Various publications are cited herein, the disclosures of which are incorporated by 

reference in their entireties. 
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